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Abstract 

' With a view to numerical applications we address the following question: given an 

ergodic Brownian diffusion with an unique invariant measure, what are the invariant 
measures of the duplicated system consisting of two trajectories? We mainly focus on 
the interesting case where the two trajectories follow the same Brownian path. In this 
case, we first show that uniqueness is essentially always true in the one-dimensional 
Oh . case. Then, in the multidimensional case, we build some explicit counter-examples 

C"| ' where the uniqueness property is not satisfied and then, give explicit conditions on 

the drift and diffusion coefficient functions (which can be interpreted as the negativity 
of a non-infinitesimal Lyapunov exponent associated with the dynamical system) to 
obtain uniqueness for invariant distribution of the duplicated system. 

As a main application, we investigate the Richardson-Romberg extrapolation for 
the numerical approximation of the invariant measure of the initial ergodic Brownian 
diffusion. 
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1 Introduction and motivations 

When one discretizes a stochastic (or not) differential equation (SDE) by an Euler scheme 
with step h, a classical method to reduce the discretization error is the so-called Richardson- 
Romberg (RR) extrapolation introduced in [TT90] for diffusion processes. Roughly speak- 
ing, the idea of this method is to introduce a second Euler scheme with step h/2 and to 
choose an appropriate linear combination of the two schemes to cancel the first-order 
discretization error. Such an idea can be adapted to the long-time setting. More pre- 
cisely, when one tries to approximate the invariant distribution of a diffusion by empirical 
measures based on an Euler scheme (with decreasing step) of the diffusion, it is also pos- 
sible to implement the same strategy by introducing a second Euler scheme with half-step 
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(see [Lem05]). In fact, tackling the rate of convergence of such a procedure involving a 
couple of Euler schemes of the same SDE leads to studying the long run behaviour of the 
underlying couple of continuous processes that we will call duplicated diffusion. When the 
two solutions only differ by the starting value and are driven by the same, say, Brownian 
motion, the resulting coupled process is (also known as 2-point motion (terminology com- 
ing from the more general theory of stochastic flows, see [BS88, Car85, Kun90]). Before 
being more specific as concerns this motivation, let us now define precisely what we call a 
duplicated diffusion. 

Consider the following Brownian diffusion solution to the stochastic differential equation 

{SDE) = dX t = b(X t )dt + a(X t )dW t , X = xe R d , (1.1) 

where b : R rf — > R d and a : M. d — > M(d,q,M) (d x q matrices with real valued entries) 
are locally Lipschitz continuous with linear growth and W is a standard g-dimensional 
Brownian motion defined on a filtered probability space (J-t)t>o) (satisfying the 

usual conditions). This stochastic differential equation (SDE) has a unique strong solution 
denoted X x = (Xf)t>o- Let p G M(q,q,M) be a square matrix with transpose p* such 
that I q — pp* is non-negative as a symmetric matrix. We consider a filtered probability 
space, still denoted (Q,A,F, (J^)t>o) on which is defined a 2(7-dimensional standard (J r t)- 
Brownian motion denoted (W, W) so that W and W are two independent (/-dimensional 
standard (J r t)-Brownian motions. Then we define a third standard g-dimensional 

(J-i)-Brownian motions by 

W ip) = p*W + ^/I q -p*pW, 

which clearly satisfies 

(W\W ( ^ j ) t = Pij t, t>0 

(the square root should be understood in the set of symmetric non-negative matrices). 
The duplicated diffusion or "duplicated stochastic differential system" (DSDS) is then 
defined by 

mqnq , = fdX t = b(X t )dt + a(X t )dW t , X = Xl GR d , 

{ ' \dX { t p) =b{xl p) )dt + a(x[ p) )dWi P \ X { p) 

Under the previous assumptions on b and a, (1.2) has a unique (strong) solution. Then 
both (Xf)t> and (X? 1 , X^ X2 )t>o are homogeneous Markov processes with transition 
(Feller) semi-groups, denoted (P t (x,dy)) t > and (Q ( f\(x 1 ,x 2 ), (dyi,dy 2 ))) t>Q respectively, 
and defined on test Borel functions / : M. d — > R and g : R 2d — > R, by 

P t (f)(x) = Ef(Xn and Q { t p) (g)( Xl ,x' 2 = Eg(X? , x[ p) ^). 

We will assume throughout the paper that the original diffusion X x has an unique 
invariant distribution denoted v i.e. satisfying vPi = v for every t€ R+. The first part of 
the paper is devoted is devoted to determining what measures are the invariant measures 
of (Q\ )t>o (if any) depending on the correlation matrix p. Thus, if p = 0, it is clear that 
v ® v is invariant for and if p = I q so is z^a = v ° (x i— > (x,x)) _1 , but are they the 
only ones? To be more precise, we want to establish easily verifiable criterions on b and 
a which ensure that z^a is the unique invariant distribution of (DSDS). In the sequel, 
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we will denote by \x a generic invariant measure of Q^ p \ We present the problem in more 
details (including references to the literature at the end of this introduction). 

> Existence of an invariant distribution for (Q^)t>o- First, the family of probability 
measures (/4 P) )t>0 defined on (M. d x M. d , Bor(R d )® 2 ) by 

lA P) = 7 f v® 2 (dxi,dx 2 )Q < f\(x 1 ,x 2 ),(dy 1 ,dy 2 ))ds 
t Jo 

is tight since both its marginals on M. d are equal to v. Furthermore, the semi-group 
(Qt^)t>o being Feller, one easily shows that any of its limiting distributions //^ as t — > oo 
is an invariant distribution for (Q^)t>o such that p^ p \dx x R. d ) = p^(R. d x dx) = u{dx). 
Also note that, if uniqueness fails and (Pt)t>o has two distinct invariant distributions v 
and v 1 ', a straightforward adaptation of the above (sketch of) proof shows that (Q[ p ^)t>o 
has (at least) an invariant distribution with marginals (v, v') and another with (v' ,v) as 
marginals. 

\> Uniqueness of the invariant distribution of (Q[ P ^)t>o- It is clear that in full generality 
the couple may admit several invariant distributions even if X has only one 

such distribution. So is the case of a fully degenerate setting (a = 0) if the flow $(x,t) 
of the ODE = x = b(x) has as a unique repulsive equilibrium and a unique invariant 
distribution v on M d \{0}. Then both distributions v® 2 and z^a (defined as above) on (M. d \ 
{0}) 2 are invariant and if u is not reduced to a Dirac mass (think e.g. to a 2-dimensional 
ODE with a limit cycle around 0) (DSDS) has at least two invariant distribution. 

In the non-degenerate case (a ^ 0) the situation is more involved and depends on the 
correlation structure p between the two Brownian motions W and W^ p \ The diffusion 
matrix E(X t Xl , X { t p) ' X2 ) of the couple (X Xl ,X^> X2 ) at time t > is given by any continuous 
solution to the equation 



*(6)/fV*(£l) <7<7*(&) 



(e.g. the square root in the symmetric non-negative matrices or the Choleski transform. . . ). 

First, note that if I q — p*p is positive definite as a symmetric matrix, it is straight- 
forward that ellipticity or uniform ellipticity of <rcr*(when q > d) for X x is transferred 
to ^(X xl ,X { t p) ' X2 )^(X Xl ,X { t p) ' X2 )* for the couple (X X \X^' X2 ). Now, uniform ellipticity, 
combined with standard regularity and growth/boundedness assumption on the coeffi- 
cients b, a and their partial derivatives, classically implies the existence for every t > 
of a (strictly) positive probability density pt(x,y) for Xf . These additional conditions 
are automatically satisfied by the "duplicated coefficients" of (DSDS). At this stage, 
it is classical background that any homogeneous Markov process whose transition has a 
(strictly) positive density for every t > has at most one invariant distribution (if any). 
Consequently, under these standard assumptions on b and a which ensure uniqueness of 
the invariant distribution v for X, we get uniqueness for the "duplicated" diffusion process 
as well. 

The hypo-elliptic case also implies the existence of a density for Xf and the uniqueness 
of the invariant distribution under controllability assumptions on a companion differen- 
tial system of the SDE. This property can also be transferred to (DSDS), although the 
proof becomes significantly less straightforward than above (see Appendix B for a precise 
statement and a detailed proof). 
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The second setting of interest, p = I q , corresponds to W w = W so that X^^ 2 = 
X X2 i.e. (DSDS) is the equation of the 2-point motion in the sense of [Kun90] section 
4.2, [Har81]. This 2-point motion has been extensively investigated (see [Car85]) from 
an ergodic viewpoint, especially when the underlying diffusion, or more generally the 
stochastic flow <&(uj,x,t) lives on a (smooth) compact Riemannian manifold M. When 
this flow is smooth enough in x, the long run behaviour of such a flow (under its steady 
regime) can be classified owing to its Lyapunov spectrum. For what we are concerned 
with, this classification is based on the top Lyapunov exponent defined by 

A s := limsup-log||D a; $(x,t)|| 

t— >+oo t 

where ||-D2;<]?(x,t)|| denotes the operator norm of the differential (tangent) of the flow. 
In this compact setting and when the top Lyapunov exponent is positive, the long run 
behaviour of the two-point on M 2 \A has been deeply investigated in [BS88] (see also 
[DKK04] for further results in this direction). Such assumption implies that A is somewhat 
repulsive. 

Here, we are in fact concerned with the opposite case because, we need in some sense 
that the paths cluster asymptotically either in a pathwise or in a statistical sense. When 
A s < 0, such clustering result has been obtained in [Car85] (see Proposition 2.3.3) in a 
local form. It is shown that the "asymptotic clustering" holds with an arbitrarily high 
probability, provided the starting points are close enough. However, this result seems to 
be not sufficient to imply uniqueness of the invariant distribution for the two-point motion 
and is still in a compact setting. 

We will use a somewhat similar approach in the section devoted to the multidimensional 
framework and obtain some sharp verifiable criterions on b and a for uniqueness of the 
invariant distribution of the two-point motion. In fact, we will see that, under some 
conditions, these criterions can be read as the negativity, or even non-positivity, of a 
non-infinitesimal Lyapunov exponent (NILS) in the neighbourhood of the diagonal (see 
Section 3.4 for details). 

Note that these criterions obtained in the case p = I q can be easily extended to the 
(last) case p* p = I q using that W w = pW is still a standard B.M. (think to p = — 1 
when d = 1). For the sake of simplicity (and since it is of little interest for the practical 
implementation of the Richardson- Romberg extrapolation), we will not consider this case 
in the paper. 

The second part of the paper is devoted to a first attempt in a long run ergodic 
setting to combine the Richardson-Romberg extrapolation with a control of the variance 
of this procedure (see [Pag07] in a finite horizon framework). To this end we consider 
two Euler schemes with decreasing steps 7 n and 7 n satisfying j2n-i = l2n = 7n/2 and 
p-correlated Brownian motion increments. We show that the optimal efficiency of the 
Romberg extrapolation in this framework is obtains is obtained when p = 0, at least when 
the above uniqueness problem for is satisfied. To support this claim we establish a 
Central Limit Theorem whose variance is analyzed as a function of p. 

The paper is organized as follows: in Section 2, we establish a one-dimensional result 
which grants the uniqueness of as an invariant distribution for (DSDS). We also 
prove that the paths of the (DSDS) are asymptotically confluent (slightly extending 
by a different method a result by Has'minskii in [Has80]). Section 3, is devoted to the 
multidimensional framework. We first provide a simple counter-example to uniqueness and 
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then provide rather general but operating criterions, all based on the behavior of the one- 
dimensional Ito process {X^ 1 —X^ 2 \ and closely related in spirit to the Feller classification 
for one-dimensional diffusions. They can be seen as asymptotic weak confluence criterions. 
Section 4 is devoted to Richardson-Romberg extrapolation for the recursive computation of 
the invariant measure of a diffusion by weighted occupation measures of an Euler scheme 
with decreasing step: we show how to improve the (weak) rate of convergence of the 
procedure by implementing a Richardson-Romberg (RR) extrapolation involving two Euler 
schemes with different steps provided i^a is the unique invariant distribution of the two- 
point motion. 

Notations. • \x\ = \/xx* denotes the canonical Euclidean norm of xS l rf (x* transpose 
of the column vector x). 

• m|| = y / Tv{AA*) if A G M(d,q,R) and A* is the transpose of A (which is but the 
canonical Euclidean norm on R d2 ). 

• A ffi 2d = {(x,x), xG R d } denotes the diagonal of R 2d . 

• S(d,R) = {S G M{d,d,R), S* = S}, S + {d,R) the subset of S(d,R) of non-negative 
matrices, S ++ (d, R) denotes the subset of positive definite such matrices and yS denotes 
the unique square root of Sg S + (d, R) in S ++ (d, R) (which commutes with S). x (g> y = 
xy * = [x iyj }€ M{d,d,R), x,ye R d . 

• If S G S ++ (d, R), we denote by ( . | . ) s and by | . | s , the induced inner product and 
norm on R d , defined by (x\y) s = (x\Sy) and \x\ 2 s = (x\x) s respectively. Finally, for 
Ae M(d, d,R), we set \\A\\ 2 S = Tr(A*SA). 

ngd) 

• A*n fJ- denotes the weak convergence of the sequence (/i n )n>i of probability measures 



defined on (R , Bor(R )) toward the probability measure \x. 

• For every function / : R d — > R, define the Lipschitz coefficient of / by [/]Lip = 

su Px^y \ x - y \ ^ 

2 The one-dimensional case 

We first show that, in the one-dimensional case d = q = 1, uniqueness of v implies that 
i^Ai as defined in the introduction, is the unique invariant distribution of the duplicated 
diffusion. The main theorem of this section is Theorem 2.1 which consists of two claims. 
The first one establishes this uniqueness claim using some ergodic-type arguments. Note 
that we do not require that a never vanishes. The second claim is an asymptotic pathwise 
confluence property for the diffusion in its own scale, established under some slightly more 
stringent assumptions involving the scale function p, see below. This second result, under 
slightly less general assumptions, is originally due to Has'minskii (see [Has80], Appendix 
to the English edition, Theorem 2.2, p. 308). It is revisited here by different techniques, 
mainly comparison results for one dimensional diffusions and ergodic arguments. Note 
that uniqueness of i^a can always be retrieved from asymptotic confluence (see Remark 
2.1). 

Before stating the result, let us recall some definitions. We denote by M the speed 

measure of the diffusion classically defined by M(d^) = tt , . , where p is the scale 

(crV)(£) 

function defined (up to a constant) by 

p{x)= f d£e~ J *o^ u)du , xG R. 
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Obviously, we will consider p only when it makes sense as a finite function (so is the case 
if b/a 2 is locally integrable on the real line). We are now in position to state the result. 

Theorem 2.1. Assume that b and a are continuous functions on R being such that strong 
existence, pathwise uniqueness and the Feller Markov property hold for (SDE) from any 
x S R. Assume furthermore that there exists A : R+ —> R+, strictly increasing, with 
A(0) = and f Q+ X(u)~ 2 du = +00 such that for all x,y G R, \cr(y) — a(x)\ < X(\x — y\). 
Then, the following claims hold true. 

(a) // (Xt)t>o admits a unique invariant distribution v , the distribution V{± = v o (£ 1— >■ 
(£>£)) * s ^ e unique invariant measure of the duplicated diffusion [Xf 1 ,X^ 2 )t>o- 

(6) (Has'minskii) Assume that the scale function p is well-defined as a real function on 
the real line and that, 

lim p(x) = ±00 and M is finite. 

x— >±oo 

Then, v = M/M(R) is the unique invariant distribution of (Xt)t>o and (p(Xt))t>o is 
pathwise confluent: F-a.s., for every x\, X2 € R, p(X xl )—p(X X2 ) tends to whent — > +00. 

Remark 2.1. > The general assumptions on b and a are obviously fulfilled whenever 
these functions are locally Lipschitz with linear growth. 

D> The proofs of both claims are based on (typically one-dimensional) comparison argu- 
ments. This also explains the assumption on a which is a classical sufficient assumption 
to ensure comparison of solutions, namely, if x\ < X2, then X^ 1 < X^ 2 for every t > a.s. 
(see [IW77]). 

> The additional assumptions made in (b) imply the uniqueness of v (see the proof below). 
The uniqueness of the invariant distribution v\ for the duplicated diffusion follows by 
(a). However, it can also be viewed as a direct consequence of the asymptotic pathwise 
confluence of p(X^'), i = 1,2 as t — > +00. Actually, if for all xi,x% G M. d , p{Xf 1 ) — 

a.s, we deduce that for any invariant distribution \x of (X Xl , X X2 ) and 

every K > 

[ (Wi) -p(x 2 )\ AK) f i(dx 1 ,dx 2 ) < lim sup 7 / ¥,J\p{X x s 1 )-p(X x s 2 )\f\K)ds = Q. 

JM. V ' i^+oo t J \ / 

As a consequence, p{x{) = p(x 2 ) ii{dx\,dx2)-a.s. Since p is an increasing function, it 
follows that fi({(x,x),x£ R}) = 1 and thus that [i = v^. 

> As mentioned before, (b) slightly extends a result by Has'minkii obtained in [Has80] with 
different methods and under the additional assumption that a never vanishes (whereas we 
only need the scale function p to be finite which allows e.g. for the existence of integrable 
singularities of \). Note however that the case of an infinite speed measure M (which 
corresponds to null recurrent diffusions) is also investigated in [Has80], requiring extra 
non-periodicity assumptions on a. 

Proof, (a) Throughout the proof we denote by (X* 1 , X^ 2 ) the duplicated diffusion at time 
t > and by (Qt((x%, x 2 ), dy%, dy2))t>o its Feller Markov semi-group. The set Idsds of 
invariant distributions of (Qt)t>o is clearly nonempty, convex and weakly closed. Since 
any such distribution [i has v as marginals (in the sense \i{dx\ x R) = /i(R x dx 2 ) = v), the 
set Idsds is tight and consequently weakly compact in the the topological vector space 
of signed measures on (R 2 , £>or(R 2 )) equipped with the weak topology. As a consequence 
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of the Krein-Millman Theorem, Idsds admits extremal distributions and is the convex 
hull of these extremal distributions. 

Let n be such an extremal distribution and consider the following three subsets of M? : 

A + = {(xi,x 2 ), X2 > xi}, A~ = {(xi,X2), x\ > X2} and Ao = {(x,x), xG R} = A R 2. 

We first want to show that if ^{A + ) > then the conditional distribution fj, A+ defined by 

fi A+ = M ^4+) ' is a l so an invariant distribution for (Qt)t>o- Under the above assumptions 

on b and a, one derives from classical comparison theorems and strong pathwise uniqueness 
arguments for the solutions of (SDE) (see e.g. [IW77]) that 

V(ari,x 2 ) G C A + = R 2 \A+, Q t {(x x , x 2 ), c A + ) = 1. 

We deduce that for every (x 1 ,x 2 ) G R 2 and t > 0, 

Q t ((x 1 ,x 2 ),A+) = P((Xf\Xf)G A+) = 1 a+ (xi,x 2 )F{t x1>X2 > t) 

where t XiX2 = inf{i > 0,X^ 2 < X^ 1 }. The second equality follows from the pathwise 
uniqueness since no bifurcation can occur. Now, let \i G Idsds- Integrating the above 
equality and letting t go to infinity implies 

fj.(A + ) = / ii(dxi,dx 2 )'F(T XUX2 = +00). 

JA+ 

If n(A + ) > 0, then [i(dxi, x 2 )-a.s. ¥(t XuX2 = +00) = 1 on A + i.e. X* 2 > X? 1 for every 
t > a.s.. As a consequence, /j,(dxi,dx 2 )-a.s., for every B G B(M. 2d ), 

1 (x 1 ,x 2 )aA+Qt{{xi,X2),B) = l ixuX2)( , A +Q t ((xi,x 2 ),B n A + ) = Q t {(xi, x 2 ), B n A + ) 

where we used again that Qt{(xi, x 2 ), A + ) = if X2 < x\. Then, since (i is invariant, we 
deduce from an integration of the above equality that 

Ijl(BC\A + ) = I Q t ((x 1 ,x 2 ),B)lr XlX2 ) eA+ fj,(dx 1 ,dx 2 ). 
It follows that if /i(^4 + ) > 0, fi A+ is invariant. 

If n(A + ) < 1, one shows likewise that f/ A+ an invariant distribution for (Qt)t>o as well. 
Then, if ^(A + ) G (0, 1), then \i is a convex combination of elements of Idsds 

l i = l i{A+) l i A+ +^A+)f/ A+ 

so that n cannot be extremal. Finally n(A + ) = or 1. 

Assume = 1 so that \i = fi(. n A + ). This implies that Xq > Xq P„-a.s.. But 

/j, being invariant, both its marginals are v i.e. Xq and Xq are ^-distributed. This yields 
a contradiction. Indeed, let <p be a bounded increasing positive function. For instance, 
set <p{v) := 1 + MGR. Then, E[ip(X^) - <p(X^)] > since Xq 1 > X5 2 F^-a.s. but 

we also have E[(£>(Xq) — ip^Xfi)] = since Xq and Xq have the same distribution. This 
contradiction implies that n(A + ) = 0. 

One shows likewise that fi(A~) = if \x is an extremal measure. Finally any extremal 
distribution of Idsds is supported by A§ = A R 2. Given the fact that the marginals 
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of fi are v this implies that \x = v& = v o (x i— > (x,x)) 1 which in turn implies that 
Zdsds = {^a}- 

(b) Since the speed measure M is finite and a never vanishes, the distribution i^(d£) = 
M(d£)/M(R) is the unique invariant measure of the diffusion. Thus, by (a), we also have 
the uniqueness of the invariant distribution for the duplicated diffusion. Let aq, x 2 e R. If 
aq > x 2 then X X1 > X X2 , still by a comparison argument, and p(X Xl ) > p(X X2 ) since p is 
increasing. Consequently M Xl ' X2 = p(X X1 ) — p(X X2 ), t > 0, is a non-negative continuous 
local martingale, hence P-a.s. converging toward a finite random limit £^' X2 > 0. One 
proceeds likewise when aq < x 2 (with t£ X2 < 0). When aq = x 2 , M t = ^' X2 = 0. The 
aim is now to show that €d^ X2 = a.s. To this end, we introduce 



1 /■* 

Vt{dyi,dy 2 ) := - Q s {(x 1 ,x 2 ),dy 1 ,dy 2 )ds, (xi,x 2 ) 
r JO 



t>2(i 



and we want to check that for every (aq,^) G R M , (nt(dyi,dy 2 ))t>i converges weakly to 
i^A- Owing to the uniqueness of z^a established in (a) and to the fact that any weak limiting 
distribution of (nt(dyi, dy 2 ))t>i is always invariant (by construction), it is enough to prove 
that (fj,t(dyi, dy 2 ))t>\ is tight. Since the tightness of a sequence of probability measures 
defined on a product space is clearly equivalent to that of its first and second marginals, 
it is here enough to prove the tightness of (t^ 1 f Q * P s (xo, dy)ds)t>\ for any xq € R. 
Let xq £ R. Owing to the comparison theorems, we have for all t > and M 6 R, 
P t (xo,[M,+oo)) < P t (x,[M, +00)) if x > x and P(x , (-00, M]) < P t (x, (-00, M]) if 
xo > x. Since z/ is invariant and equivalent to the Lebesgue measure, we deduce that 

P t( x ,[M, + oo))< 4f^ll and P t(xo , ( _oo,M))<4^M. 

z/([x ,+oo)) z/((-oo,x J) 

The tightness of (Pj(xo, dy))t>\ follows (from that of v) and we derive from what preceeds 
that 

V (xi,x 2 ) € R 2d , -/ Q s {(xi,x 2 ),dyi,dy 2 )ds^=l i>A(dyi,dy 2 ). 
t Jo 

Now, note that for every Le N, the function g L : (yi,y 2 ) >->■ |p(yi) — p(y 2 )| AL is continuous 
and bounded. Hence by Cesaro's Theorem, we have that 

^^Q s (g L )(x 1 ,x 2 )ds = ^j\g L (X x \X X2 )ds^E(\e x ^ X2 \ A L) 

whereas, by the above weak convergence of ([J>t{dyi,dy 2 ))t>i, we get 

7/ Qs(g L )(xi,x 2 )ds — > I g L (yi,y2)vA(dyi,dy 2 ) = as t ->• +00 

t JO JR d 
since g L is identically on A R2 d. It follows, by letting L go to infinity, that 

EIO^I = 0. 

This implies t^ 2 = P-a.s. which in turn implies that 

P-a.s. p(X Xl ) - p{Xf 2 ) ^ as i +00. 

Finally, it remains to prove that we can exchange the quantifiers, i.e. that P-a.s., p(X Xl ) — 
p{X X2 ) — > for every aq, x 2 . Assume that aq > x 2 . Again by the comparison theorem 



and the fact that p increases, we have < p(X Xl ) -p(X X2 ) < p(x\ Xl ^ +1 ) -p(x\ X2 ^). This 
means that we can come down to a countable set of starting points. □ 

In the continuity of the second part of Theorem 2.1(6), it is natural to wonder whether a 
one-dimensional diffusion is asymptotically confluent, i.e. when for all x\,x% G R, X Xl — 
X X2 tends to a.s as t — >■ +oo. In the following corollary, we show that such property 
holds in a quite general setting. 

Corollary 2.1. (a) Assume the hypothesis of Theorem 2.1(b) hold. If furthermore, 



f x b 

and limsup / — < +oo 



a never vanishes 

\x\ 

then, P-a.s., for every x±, X2G R, 

x x x _ x x 2 _^ afj +oo _ 

(b) The above condition is in particular satisfied if there exists M > such that for all 

\x\ > M sign(x)6(x) < 0. 

Proof, (a) Under the assumptions of the theorem, p is continuously differentiable on R 
and x 

/ / f x 2b 

Then it is clear that p[ n{ = infp'(x) > iff limsup / —o(Qd£ < +00. By the fundamental 

\x\-++ooJx a 

theorem of calculus, we know that, 



p 

and the result follows from Theorem 2.1(b) . 

(b) Since a never vanishes, p" is well-defined and for every x £ R, p"(x) = — —r^r^ ■ 
Using that p' is positive, we deduce from the assumptions that 

3M > such that 



p"(x) > 0, x > M 
p"(x) < 0, x < — M. 



Now, p' being continuous, it follows that p' attains a positive minimum p' inin > 0. □ 
Examples. 1. Let U be a positive a twice differentiable function such that lim U(x) = 

\x\—>+oo 

+00 and consider the one-dimensional Kolmogorov equation 

dX t = -U'{X t )dt + adW t 

where a > 0. Then, 

2 

lim inf xU'(x) > — => - A^f a.s. 

I a; I — >+oo 2 

Note that in particular, this result holds true even if U has several local minimas. 
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2. Let a = M — > (0, +00) be a locally Lipschitz continuous function with linear growth so 
that the SDE 

dX t = a(X t )dW t 

defines a (Markov) flow (Xf)t>o of local martingales. If - G L 2 (R, Bor(M.), A) then there 
exists a unique invariant measure v{d£,) = c a ^zjfs and (X^)t>o, i = 1,2 is pathwise 
confluent (in the sense of Theorem 2.1(6)) since p(x) = x. 

Note that the linear growth assumption cannot be significantly relaxed since a sta- 
tionary process cannot be a true martingale which implies that v has no (finite) first 
moment. 



3 The multidimensional case 

In this section, we begin by an example of a multidimensional Brownian diffusion (X X1 , X 12 ) 
for which v& (image of v on the diagonal) is not the only one invariant distribution. Thus, 
Theorem 2.1 is specific to the case d = 1 and we can not hope to get a similar result for 
the general case d > 2. It is of course closely related to the classification of two-point 
motion on smooth compact Riemannian manifolds since the unit circle will turn out to be 
a uniform attractor of the diffusion. 



3.1 Counterexample in 2-dimension 

Roughly speaking, saying that is the only one invariant distribution means in a sense 
that Xf — X\ has a tendency to converge towards when t — > +00. Thus, the idea in the 
counterexample below is to build a "turning" two-dimensional ergodic process where the 
angular difference between the two coordinates does not depend on t. Such a construction 
leads to a model where the distance between the two coordinates can not tend to (Note 
that some proofs are deferred to Appendix B). We consider the 2-dimensional SDE with 

Lipschitz continuous coefficients defined by, Vx 6 M 2 



b(x) 
a{x) 



xl {o<M<i} ~ 
#Diag(6(x)) + 



L{|x|>l 



} )(i-N) 



cx 



-cx 
0, 



where c£ (0, +00) are fixed parameters. 

Switching to polar coordinates X t = (r t cosipt,r t siinp t ) 
SDE also reads 

dr t = min(r t ,l)(l - r t ){dt + ddW}), 
dip t = cdW?, ^ G[0,2vr), 



t E 



r € 



we obtain that this 

(3.3) 
(3.4) 



where xo = ro(cos po, sin <po) and W = (W 1 ,W 2 ) is a standard 2-dimensional Brownian 
motion. 

Standard considerations about Feller classification (see Appendix B for details) show that, 
if x / (i.e. r > 0) and #e (0, y/2) then 



1 as t — > +00, 



(3.5) 
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while it is classical background that 

1 f* 

P-a.s. \/(p £R + , -J S^+^ds =>• A 5l as t ->• +00 

where Si denotes the unit circle of M 2 . Combining these two results straightforwardly 
yields 

VxG R 2 \ {(0,0)}, F-a.s. -f 6 X fds^3-\ Sl as t -> +00. 

On the other hand, given the form of (ft , it is clear that if x = rQe lLpo and x' = r' e itp 'o, 
ro, r' / 0, ip , ip' £ [0,2vr), then 

lim \X?-X?\ = le^-Vo) - 1| 

which in turn implies that 

lim - / \X X S - Xf\ds = le^-^o) - 1| P-a.s. 

This limit being different from as soon as (po (p' , one derives, as a consequence, that 
z^A cannot be the only invariant distribution. In fact, a more precise statement can be 
proved. 

Proposition 3.1. (a) A distribution fi is invariant for the semi-group (Qt)t>o of the 
duplicated diffusion if and only if (i has the following form: 

H = £(e ie ,e i( - e+v ^ (3.6) 

where © is uniformly distributed over [0, 2n] and V is a [0, 2ir) -valued random variable 
independent of 6 

(6) When V = a.s., we retrieve v& whereas, when V also has uniform distribution on 
[0, 2ir], we obtain u (g> v. Finally, fj, is extremal in the convex set of (Qt)t>o invariant 
distributions if and only if there exists 6q £ [0, 2w) such that V = 8q a.s. 

The proof is postponed to Appendix B. However, note that the claim about extremal 
invariant distributions follows from the fact that for every 6 G [0, 27r), (Qt)t>o leaves the 
set T e := {(e^,e^') € <Si x Si,(f/ - tp = 0mod.2vr} stable. 

Remark 3.2. In the above counterexample, the invariant measure of (rt)t>o is the Dirac 
measure 5±. In fact, setting again x = Eoe tLpo and x' = r^e^o and using that Xf — Xf = 
r x ^ e i( V0 +w?) _ e i( v ' +w?)^ + ( r x _ r x'y(<p' +w?)^ an eagy adaptation of the above proof 

shows that it can be generalized to any ergodic non-negative process (rt)t>o solution to 
an autonomous SDE and satisfying the following properties: 

• Its unique invariant distribution ir satisfies 7r(M?j_) = 1. 

• For every x,y£ (0, +00), rf — r\ — > a.s. as t — > +00. 

For instance, let {Xf )t>o be an Ornstein-Uhlenbeck process satisfying the SDE dXt = 
—Xfdt + adWt,Xo = x. Set rf = (Xf ) 2 (this is a special case of the Cox-Ingersoll-Ross 
process). The process (rf ) clearly satisfies the first two properties. Furthermore, {Xf ) t >o 
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satisfies a.s. for every x,y£ M. and every t > 0, \Xf — Xf\ = \x — y\e t . Then, since for 
every 



t t 



X x ds + a > a.s. as t — > +oo, 



o 



it follows that (rf )t>o also satisfies for all positive x, y, rf — r\ — > a.s. as t — > +oo 
(Many other examples can be built using Corollary 2.1). 

Finally, note that if n = C{Re l ® , Re % (® +V ^) where R, and V are independent random 
variables such that the distributions of R and are respectively n and the uniform dis- 
tribution on [0, 2ir] and V takes values in [0, 2ir), then \i is an invariant distribution of the 
associated duplication system. 

3.2 Uniqueness of the invariant measure: (S, #)-confluence 

In the sequel of this section, we propose criterions for the uniqueness of the invariant 
distribution of the duplicated system in the multidimensional case. The underlying idea 
of the criterions discussed below is to analyze the coupled diffusion process (X Xl ,X X2 ) 
through the squared distance process rt = \X Xl — X X2 \ 2 S (where | . | s is an Euclidean norm 
on M. d ). It is somewhat similar to that of Has'minskii's test for explosion of diffusions in 
W 1 or to the one proposed in Chen and Li's work devoted to the coupling of diffusions 
(see [CL89]). We begin by a general result under an assumption depending on a continuous 
(non-explicit) function 9 : (0, +oo) —> R+. Then, more explicit criterions are derived in the 
next subsections. In particular one involves a kind of bi-variate non-infinitesimal Lyapunov 
exponent. 

Theorem 3.2. Assume that b is continuous, a is Lipschitz continuous and that strong 
existence, pathwise uniqueness and the Feller Markov property hold for (SDE) from any 
x G R d . A ssume that the set X$de of invariant distributions of SDE is (nonempty, 
convex and) weakly compact. Furthermore, assume the SDE (1.1) is asymptotically (S, 9)- 
confluent in the following sense: there exists a continuous function 9 : (0, +oo) —> such 
that 

^) Inn sup / aw < +oo. 

u-s>0+ Ju w 
(ii) 3 S G S ++ (d, M) such that V x,y G R d , x / y, 

(b(x) - b{y)\x - y) s + ||K*) - a(y)\\l < 9(\x - y| 2 g ) (o*(x) - a*(y))f^ 



2 



(a) Weak confluence (Uniqueness of both invariant distributions): Then, iflDSDS denotes 
the set of invariant distributions of the duplicated system (DSDS), one has 

Isde = {^} and Idsds = {^a}- 
keeping in mind that v/± = v o (x i— > (x, x))^ 1 . 

(b) Pathwise confluence: If furthermore, / » dv < +oo and for every x G M , 

t 

— I P s (x,dy)ds) is tight, we have a.s. pathwise asymptotic confluence: 
t Jq Jt>i 



Vsi, x 2 G M d , X? 1 - X? 2 — ► as t -> +oo F-a.s. (3.7) 
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Remark 3.3. > If we also assume in (a), that ^— J P s (x,dy)ds S j is tight then, so 

is f— / Q s (x,x' ,dy,dy')ds] . Since, by construction, the weak limiting distributions of 
\t J Jt>i 

1 [* 

this sequence as t — > +00 are invariant distributions, this implies that - / Q s (x, x , dy, dy')ds 

t Jo 

weakly converges to f a as M +00. This explains the "weak confluence" terminology. 

> In both statements, Assumption(iz) must be combined with an additional assumption on 
6 to generate some weak or pathwise confluence properties (some easier to verify criterions 
will be given in the next subsection). Since pathwise confluence implies uniqueness of the 
invariant distribution for (DSDS), one can expect that the additional condition on 6 of 
(6) implies Assumption (i). This is true in full generality. Actually, owing to the Cauchy 

criterion, J Q e J « ™ dv < +00 implies that Ju e J « ™ dv — > as u — > . Using that 

v 1 — y e J " w is non-increasing on (0,1], it follows that ue J ^ ™ — )• + as u — >• 0. 
Taking the logarithm yields J 1 ^^—^-dw — > —00 and thus, Assumption (i). 

> If b and a are Lipschitz continuous, Kunita's Stochastic flow theorem (see [Kun90], 
Section 4.5) ensures in particular that, if x\ 7^ X2, the solutions X^ 1 and X^ 2 a.s. never 
get stuck. Taking advantage of this remark slightly shortens the proof below. 

t> Tightness criterions of ( — [ P s (x, dy)ds] for every x G M. d usually rely on the mean- 

\t J Q Jt>i 

reversion property of the solutions of (SDE) usually established under various assumptions 
involving the existence of a so-called Lyapunov function V going to infinity at infinity 
and such that AV is upper-bounded and limsup\ x ^ +OQ AV(x) < where A denotes the 
infinitesimal generator of X x (so-called Has'minskii's criterion). Keep in mind that 

AV(x) = (6|VV)(a?) + -Tr(aa*(x)D 2 V(x)j 

where Tr(^4) stands for the trace of the matrix A. 

On the other hand, a classical criterion for pathwise asymptotic confluence (a.s. at 
exponential rate, see e.g. [BB92], [Lem05] and often known as asymptotic flatness) is 

Vx, yeR d , (b(x)-b(y)\x-y) + ^\\a(x)-a(y)\\ 2 <-c\x-y\ 2 , (c > 0), 

and, as a straightforward consequence, uniqueness of the invariant distribution v of (SDE) 
(and of (DSDS) as well). Moreover, putting y = in the above inequality straightfor- 
wardly yields real coefficients a > 0, /3 > such that AV < (3 — aV with V(x) = \x\ 2 . 
Hence Has'minskii criterion is fulfilled, so it it is also an existence criterion for the invariant 
distribution which (S, #)-confluence is not. 

But in fact, the (S, #)-confluence criterion turns out to be a much weaker condition to 
ensure the uniqueness of the invariant distribution v of (SDE) (and of z^a for (DSDS)). 
This is emphasized by several operating criterions derived from this ^-confluence assump- 
tion. Finally, combined with the tightness of the occupation measure of the semi-group, 
it becomes a criterion for a.s. pathwise asymptotic confluence. 

Proof. Step 1: Exactly like in the beginning of the proof of Theorem 2.1(a), one checks 
that the set Idsds of invariant distributions of (Qt)t>o is a nonempty, convex and weakly 
compact subset of V(M. 2d ). As a a consequence of the Krein-Millman theorem, Tdsds has 
extremal distributions (and is their closed convex hull). 
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On the other hand, it follows from strong uniqueness theorem for SDE's that the 
semi-group (Qt)t>o leaves stable the diagonal A M 2d. 

Let xi, X2 € M. d , x\ 7^ X2- We define the stopping time 

T X1 , X2 :=inf{t> 0|Xf = X?}. 

Still by a strong uniqueness argument it is clear that {r XliX2 > t} = {Xf 1 / -Xf 2 } so that 

Qt({xi,x 2 ), c A R 2 d ) = lcA R2d (xi,x 2 )P(r xl)X2 > i) 

and Q t ((xi,xi), c A R 2d) = 0. 

Let jug Idsds be an extremal invariant measure. We have, for every t > 0, 

M( c A M 2d) = / /x(da;i,rfa;2)P(r XljX2 > i). 

Letting t go to +00 yields 

n( c A R 2d) = / fi(dx 1 ,dx 2 )F(T Xl>X2 = +00) 

so that, on c A R2 d, /J,(dx±,dx2)-a.s., P(t x1iX2 = +00) = 1 or equivalently the process 
(X X1 ,X X2 ) lives in c A R 2d. Consequently, if /i( c A R 2d) G (0,1), both conditional measures 
// A m2d and // A K 2d are invariant distributions for (SDSD) as well. If so, 

H = n( c A R2d )fi CA ^ + M (A MM )/A M 

cannot be extremal. Consequently ^(A R 2d) = or 1. 

Step 2: Let /j, be an extremal distribution in Idsds an d assume that / u( c A IR 2d) = 1. We 

will prove that this yields a contradiction under Assumptions (i) and (ii). 

To this end, let us introduce the pseudo-scale C 2 -function fg and its companion gg defined 

by 

eh ~ dw di and gg(u) = uf' e {u). 

Note that gg > 0, that Assumption (i) reads lim sup u _ >0 + 9e ( u ) < +°° and that g'g{u) = 
fg(u)(l — 0(u)). Moreover, if Assumption (ii) is fulfilled, so is the case by any continuous 
function 9 satisfying 8 > 6. As a consequence, we may modify 6 on [1, +00) so that 8 still 
satisfies (ii) and > 1 over [2e, +00). Then the function gg is nonincreasing on [2, +00). 
Consequently, without loss of generality, we may assume in the sequel of the proof that 

supgg(u) < +00. (3.8) 
u>0 

We now define a (Lyapunov) function ip : c A R 2d — > M by 

<p(vi,V2) ■= fe(\yi - V2\ 2 S )- 

We know from Step 1 that n(dx\,dx2)-a.s., (Xp / X* 2 for every t > 0) a.s.. Then, fg 
being a C 2 -function, we derive from Ito's formula applied to ip(Xp , X^ 2 ) that fj,(dx\,dx2)- 
a.s., 

<p{X?\X?) = <p(x 1 ,x 2 ) + f A (2) <p(X*\X* 2 )ds 

Jo 

+ f f e (\X? - X* 2 \ 2 s ){(a*(X^) - a*(X?))S(X? - X?)\dW 9 ) 
Jo 

. ' 

= :Mt local martingale 
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where, for every (x\,x 2 )£ (M. d ) 2 , 

A {2) if(x 1 ,x 2 ) = 2((6(xi) - b(x 2 )\x l - x 2 ) s + -\\a{ Xl ) - a(x 2 )\\ 2 s )f e (\x 1 - x 2 \ 2 s ) 

+ 2f' g '(\x 1 - x 2 \D\(a*( Xl ) - a*(x 2 ))S( Xl - x 2 )f . (3.9) 

Using that / satisfies the ODE = 6(£)f' g (£) + £#'(0 = 0, £ G (0, +oo) we deduce from 
Assumption (ii) that A^(p(x\, x 2 ) < if x\ ^ x 2 and = otherwise. 

On the one hand, since \i is extremal and since the sign of A® (p is constant, we can 
apply Birkhoff's theorem and obtain: 

fi(dx h dx 2 )-a.a., - [ A {2) y{X x s \ X x s 2 )ds f ^ + °°> I A {2) <pdfi G [-oo, 0] a.s. 

In fact, f c , A^(pdfi belongs to [— oo, 0) since A® ip is negative on c A K 2d and /x( c A K 2d) = 
1. On the other hand, using that gg is bounded and a is Lipschitz continuous, it follows 
that (Mt)t>o is an L 2 -martingale such that 



\M) t - I ge{\x s -x s \ ) — w — — w — 

A., — A. s Ac — A., „ 







2 ds < Ct 



where C is a deterministic positive constant so that ^ — > P^-a.s.. 
As a consequence, fJ,(dx\, dx 2 )-a.s., 



(3.10) 



lim ^r.^f 2 ) = /" ^(2)^ < a s 

t^+OO t IcA 



Hence, a.s., /(|Xf - X X2 \ 2 S ) = <p(X?\X? 2 ) -oo a.s. 

If lim u _ > o+ /e('u) > — oo, this yields a contradiction since fg is increasing on Otherwise 
\ x xi_ x x2\2 Q But a ppiy in g 

again Birkhoff's theorem, we obtain fj,(dx%, dx 2 )-a.s., 



I 



Vi ~ y2\ 2 s n{dyi,dy 2 ) = lim - / \X Xl - X X2 \ 2 ds = a.s., 

a t-s>+oo t Jo 



which contradicts the assumption /i( c A R 2d) = 1. Consequently, for any extremal distribu- 
tion /j,, we have /i(A K 2d) = 1. 

We can now prove Claim (a): by Krein-Millman's Theorem Isds is the weak closure 
of the convex hull of its extremal distributions. Consequently, the diagonal A K 2d being 
a closed subset of M. 2d , all invariant distributions of the duplicated system are supported 
by this diagonal. For any such invariant distribution /i, both its marginals are invariant 
distributions for (SDE). If (SDE) had two distinct invariant distributions v and u', we 
know from the introduction that Idsds would contain at least a distribution \i for which 
the two marginals distributions are /i(. xl d ) = v and fi(M. d X .) = u' respectively. As a 
consequence, such a distribution /i could not by supported by the diagonal A R 2d. Finally, 
1-SDE is reduced to a singleton {v} and Idsds = {^a}- 

Step 3 (Claim (b): Proof of (3.7)): Under the additional assumption on 8 of (b), we 
have lim u _ 5>0 + fe( u ) > ~°° an d thus, inf u> o fe(u) > — oo since fg is increasing. Let x%, 
x 2 G M. d . Using again that A^f < on c A K 2d where A^f is given by (3.9). It follows 
that (fg(\X Xl — X X2 \ 2 )) t>0 is a lower-bounded P-supermartingale. Thus, it a.s. converges 
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toward L^' X2 G L 1 (P). Using again that fg is increasing, it follows that |Xf J — X X2 \ 2 a.s. 
converges toward a finite random variable £^' X2 = /^(Lc 1 '" "~ 1 



■'00 



Now, using that for every x G f- / P s (x,dy)ds] is tight, we derive that 

t 7 /t>i 

/ Q s ((xi, x 2 ), (dyi, dy2))ds I is tight as well. Then the uniqueness of v& as an 
Jo ' t ^ 



t 

invariant distribution of Q implies that 



Q s ((xi,x 2 ), (dyi,dy 2 ))ds ^ is A . 



t 

Now for every bounded continuous function g : 



7 f Q s {g{\yi-y2\ 2 s ){xi,x 2 )ds = - [ E g (\x? - xf \ 2 s )ds — ► e 5 (C 

1 JO r JO 



X'2 - 



so that 



MO* 2 ) = / 5(|yi -y2| 2 >A(*/i,A/ 2 ) = 5 (0). □ 



.S' 



In Assumption (n) of the previous theorem, we see that the function (x, y) ^ \ (a* (x) — 
a*(y))S(x — y)\ plays an important role. In the sequel, we will obtain specific results when 
this function is not degenerated away from the diagonal. Such type of assumption will be 
called strong or regular directional S-ellipticity assumption. 

In the following proposition, we first show that when such an assumption is satis- 
fied, claim (b) of the previous theorem still holds without the tightness assumption on 
(i Jq P s (x,dy)ds)t>i (although it is a not really restrictive in our framework (see the 
fourth item of Remark 3.3)). 

Proposition 3.2. If the function 6 is (0, l]-valued and a satisfies the following strong 
directional S-ellipticity assumption away from the diagonal 

3 Q > 0, Vx, ye R d , \(a*(x) - a*(y))S(x - y)\ > a \x - y\ 2 , (3.11) 

then the conclusion of Claim (b) in the above proposition remains true without the tightness 
assumption on (| J Q * P s (x,dy)ds)t>i- 

Proof. First, we recall that under the assumptions of (b), we recall that (fg(\X X1 — 
X X2 \ 2 ))t>o is a lower-bounded P-supermartingale thus convergent to an integrable ran- 
dom variable and that this implies that (l-X^ 1 — X X2 \ 2 )t>o is a.s. convergent to a finite 
random variable i^ X2 (since fg is increasing). On the other hand, since <p is positive 

and fg is lower-bounded, we also have that 

f e (\X?-X?\ 2 g )- [ t A (2 ^(X x \X X2 )ds = i P (x 1 ,x 2 ) + M t 
Jo 

is a lower bounded P- (local) martingale starting at a deterministic starting value, hence 
converging toward an integrable random variable. Owing to the computations of (3.10) 
(which hold for every starting points x%, x 2 ), (Mt)t>o is i n f & ct an L 2 - convergent mar- 
tingale. Thus, (M)oo < +oo and taking advantage of the expression of this bracket (see 
(3.10)) and to Assumption (3.11), we derive that for every e > 

/»+oo 

J o (geiixf 1 - K 2 \ 2 s ) 2 i { ix^-xp\i>e} ds < +°° a - s - 
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The function gg is positive on (0, +00) and non-decreasing since g' e (u) = f' d (u)(l — 6(u)) > 
0. This implies that, for every e > 0, 

liminf ge{\X^ - X? 2 \ 2 s )l {lxxl __ XX2l < >> e} = a.s. 

Combined with the convergence of the squared norm this yields 

Ve>0, 9e{l%> X2 )\f^> £} = a.s. 
which finally implies £^' X2 = a.s. □ 

3.3 Global criterions, NILS exponent 

In this section and the following, we derive several corollaries of Theorem 3.2 illustrated 
by different examples. For S 6 S ++ (d, R), set 

[0\S,+ - SUp — — 2 • 



\-E y\ 



s 



Note that if [b]s.+ < +00 and if a is Lipschitz continuous, strong existence, pathwise 
uniqueness and the Feller Markov property hold for {SDE). 

Proposition 3.3. Assume \b]s,+ < +00, a is Lipschitz continuous and (SDE) and 
^sds is non empty and weakly compact. Let 8 : (0, +00) — > M + be a continuous function 
such that Assumption (ii) of Theorem 3.2 holds. Then, 

(a) If there exists Eq > such that 0(u) < 1, u £ (0,£o], then (SDE) (1.1) and its 
duplicated system have v and v& as unique invariant distributions respectively. 

(b) If furthermore for every xG M. d , (| J Q * P s (x, dy)ds)t>\ is tight and if there exists k > 1 
and Eq£ (0, e~ f) such that 



V«€(0,s ], 0{u)< 1 + - , (3.12) 

V log u / 

the duplicated system of (SDE) is pathwise confluent in the sense of Theorem 3.2(b). This 
condition is in particular satisfied if there exists £q > and 9q£L (0,1) such that 

Vug(o,£ ], e( u )<e . 

Claim (a) is obvious. For (b), one checks that Jq eh ~^~ dw dv < +00 as soon as 
lim inf u __ >0 + \og(u)(6(u) — l) > 1 and the result follows. 

Remark 3.4. The simplest case where the preceding result holds is obtained when 6 = 0. 
In this case, the (S, 0)-confluence global condition reads: 

Vx, y£R d , x^y, (b(x) - b(y)\x - y) s + ~\\a( X ) - a(y)f s < 0. (3.13) 

In fact, this condition is also required at every (x,y), x 7^ y, such that (a*(x) — a*(y))S(x — 
y) = 0, whatever 9 is. In the next section, we will see that when (a*(x) — a*(y))S(x — y) 
never vanishes away from the diagonal, one can derive a "localized" version of the criterion 
outside the diagonal. 

The constant function 6=1 clearly satisfies the assumption in (a) of the above Propo- 
sition. This leads us to introduce an important quantity of interest for our purpose. 
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Definition 3.1. The non-infinitesimal S-Lyapunov exponent (NILS) is a function on 
M. 2d \ A R 2d defined for every x, y€ M. d , x / y, by 

^_ (b(x)-b(y)\x-y) s l\W(x)-a(y)\\ 2 s f (a* (x) - a* (y)) S (x - y) 

A s y x >y) — \z — 775 r o 



\x-y\ 2 s 2 



\x - y\l 



Corollary 3.2. Assume b and a are like in Proposition 3.3 andlsDE is non empty 
and weakly compact. 

(a) Negative NILS exponent: if 

Vx, ye R d , x^y, X s {x,y) < 0, 

then (SDE) and its duplicated system have v and as unique invariant distributions 
respectively. 

(b) Negative NILS exponent bounded away from 0: // furthermore (j JjJ P s (x, dy)ds)t>i is 
tight for every x£ W 1 or a satisfies (3.11) and if there exists cq > such that 

Vx,yeR d , x^y, \x - y\ 2 s < e X s (x, y) < -c (3.14) 

then the duplicated diffusion is pathwise confluent i.e. 

Vx 1 ,x 2 £M d , X x t x -X x t 2 — >0a.s. as t +oo. 

Proof, (a) follows from claim (a) in the above proposition with 6 = 1. (b) follows from 
claim (b) in the same proposition. □ 

Note that in (b), Condition (3.14) can be replaced by the sharper following one: for all 
i,i/6 M. d , X s (x, y) < and there exists k > 1 and £q E (0, e~2 ) such that for all x, y 6 M d 

such that \x - y\ s < e , X s (x,y) < - — - r^-. 

log(\x-y\ 2 ) 

3.4 Applications and extensions 
3.4.1 Localization around the diagonal 

By local we mean that the (S, #)-confluence condition will be effective only in the neigh- 
bourhood of the diagonal A K 2d. The price to pay is a regular directional ellipticity as- 
sumption on o~(x) — o~(y) in the direction S(x — y) away from the diagonal. 

Proposition 3.4. Assume [b]s,+ < +oo, a is Lipschitz continuous and (SDE) admits 
at least one invariant distribution v. If there exists eq > such that 

( (i) Directional S- ellipticity: r/o := inf j|(cr*(x) — a*(y))S(x — y)\, \x — y\ s > £oj > 0, 
[ (ii) Locally negative NILS exponent: Vx, y£ M. d , < \x — y\ s < £q, X s (x,y) < 0, 

then (SDE) (1.1) and its duplicated system still have v and z/a as unique invariant dis- 
tributions respectively. 

Proof. For every u£ (0, +oo) 

(b( x )-b(y)\x-y) s + \\\a(x)-a(y)\\ 2 s 

SUp 2 < "( U ) 

\x-y\ s =u 



(a*(x)-a*(y))S 



* — 1/| 
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where 



6{u) 



1 if«€(0,e /2) 



where 5q is a positive number. Consequently 9 can be extended into a continuous function 
over (0, +oo) {e.g. by an affine extension between £q/2 and eo) which satisfies the condition 
of Proposition 3.3. □ 

Corollary 3.3 (Smooth coefficients). Assume the functions b and a are continuously 



differentiable. Let Jb(x) = ^jr:( x ) 



l<i,j<c 



denote the Jacobian ofb at x and let Vff(x) 



denote the gradient of a at x. If both SJb + JtS and V<r are Lipschitz 



continuous and if Vex is bounded then the NILS exponent is locally negative ( condition (ii) 
in Proposition 3.4) as soon as 



sup sup ( u* (SJ b (x) + Jb(x)S) u + (Va(x)\u) 
'■\u\ s =l 



(Va*(x)Su\u) 



< 



where, for every v = (v 1 , . . . ,v d ) 6 M. d , (Va(x)\v) = (Vcr^- (x)\v) 



l<i,j<d 



and Va(x)v 



dan 



^ dx k y 



.k=l 



x)v 



When S = Id, this may also be written 



J l<i,j<d 



sup sup u* J b {x) + j;{x) + (V^ii(a;))® 2 - 2\{Va ij {x)\u) \ UVa*j(x)\u) 

J |u|=1 V l<i,j<d 



u<0. 



We leave the computational details to the reader. 



3.4.2 The case X s < 

The previous results require the NILS exponent to be (strictly) negative (at least near the 
diagonal). When the invariant distribution v of (SDE) is unique and not degenerated, 
some results can be obtained when the NILS exponent is only non-positive and negative 
outside of a compact set. 

Proposition 3.5. Assume [b]s+ < +oo, a is Lipschitz continuous and (SDE) has a 
unique invariant distribution v whose support is not compact. Then, uniqueness for 
holds true as soon as 

Vx,y G M d , X s (x,y) < and 3 R > s .t. max(|x| g , \y\ s ) > R => \ s (x,y) < 0. (3.15) 

Proof. Let /j, be an extremal (hence ergodic) distribution \i for the duplicated system such 
that /u( c A R d) = 1 like in the proof of Step 2 of Theorem 3.2. Following this proof with 
8 = 1 (so that (p(xi,X2) = log(|xi — X2\ 2 )), we still have by the ergodic theorem that 

P M -o.s. jloglXf 1 -Xf 2 ! 2 — >c p := I X s (xi,X2)n(dx 1 ,dx 2 ). 

t JR 2d 

If < then both components are pathwise confluent, which implies following the proof 
of the theorem, that /u(A R 2d) = 1. Consequently c M = and then fi({X s = 0}) = 1 
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since A s is non-positive. Which in turn implies that /j, is compactly supported owing to 
the assumption made on the NILS exponent. On the other hand v being unique, both 
marginals of \i are equal to v. Hence the contradiction since the support of v is not 
compact. □ 

Remark 3.5. • In the particular case where a is constant, Condition (3.15) becomes a 
monotony condition on b (decrease with respect to (.|.) s at infinity), namely: 

Vx,yeR d , x + y (6(x) - b(y)\x - y) s < 0, 
and 3i?>0s.t. max(|x| s , \y\ s ) > R ==> (b(x) — b{y)\x — y)s < 0. 

This means that b is 5-non-increasing on R rf , S'-decreasing outside B\ _| (0; R) 2 . For in- 
stance, if b = — V£/, the above assumption holds if U is convex and (only) strictly convex 
outside of a compact set. 

• Note that when X7U is only increasing outside B\\ s (0;R) but possibly with no specific 
monotony on (0; R), it is still possible to find some diffusion coefficients a such that 
the SDE dXt = — VC/ (Xt)dt+a(Xt)dWt remains weakly or pathwise confluent. We refer to 
the next subsection for models with such stochastically stabilizing diffusive components. 

• Finally, note that the above condition (3.15) can be also localized around the diagonal 
under the directional S-ellipticity assumption. To be more precise, when v is unique and 
its support is not compact, Proposition 3.5 still holds if Assumption (ii) is "localized" 
into: 

(ii)i oc = for every x, y £ M. d such that < \x — y\ < Eq, Xs(x, y) < 0. 



3.4.3 Local criterion on compact sets 

As mentioned before, the diffusion is asymptotically (S, 0)-confluent (i.e. (S, #)-confluent 
in the sense of Theorem 3.2 with 9 = 0), if there exists S G S ++ (d, R) such that (3.13) 
holds. One asset of this more stringent assumption is that it can be localized in two ways: 
first in the neighbourhood of the diagonal like in the above local criterions, but also on 
compacts sets of M. 2d . This naturally leads to a criterion based on the differentials of b 
and a when they exist. 

Proposition 3.6 (Criterion on compact sets), (a) Let S G S ++ (d, R) such that for every 
R > 0, there exists 5r > such that \/x,y£ B\ | (0;R), 

0<\x-y\ s <5 R =>- (b(x) - b(y)\x - y) s + h\a{x) - a(y)f s < 0. (3.16) 
Then the diffusion is asymptotically (S 1 , 0)- confluent. 

(b) If b and a are continuously differentiate, then (3.16) holds as soon as 

{AC) di ff = Vxe R d , SJ b {x) + J 6 *(x)S+v / S^(Vf7 ij (2;))® 2 v / 5<0 in S(d,R). 

Proof, (a) Let x,yG R rf such that x / y. Set R = max(|x| s , \y\ s ) and 

i 

x = x, Xi = x + — (y - x), i = 1, . . . , N - 1, x N = y 
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where \y — x\ s < N5r. Then for every i G {1, . . . , N}, \xi\ s < R and \xi — Xi-\\ s < 5r. 
Then 



N 2 N 



\<j(x) - a(y)\\ 2 s = \\^2<T(xi) - cr(xi-i) < N s ^\\(j(x i ) - a(xi-i 



s 

i=i i=i 



N N 

< -2Nj2(b(xi) -b(xi-i)\xi -Xi-i) 8 = -2^2 (b(xi) - b(xi-!)\y - x) t 

i=i i=i 

< -2{b{y)-b{x)\y-x) s . 

(6) First, we prove the result when S = Id- We note that, for every continuously 
differentiable function g : R d — > R, g(y) — g(x) = (Vg(x + t(y — x))\y — x)dt = 
f (y — x)*Vg(x + t(y — x))dt so that 

(b(y)-b(x)\y-x) = / (y-x)*J b (x+t(y-x))(y-x)dt = / (y-x)* J b *(x+t(y-x))(y-x)dt 
Jo Jo 

and 

2 



\a(y) - a(x)\\ 2 = £ ( f 



(V<rij(x + t(y - x))\y - x)dt 
By Schwarz's Inequality and the fact that (u\v) 2 = u*v® 2 u, we deduce 
(b(y) - b(x)\y -x) + h\o{y)-a{x)f < ± jf ((y - x)*(J b + J fe *)(x + t(y - x))(y - x) 

+ \ X> - X T ( VcJ ^ + Kv - x))f 2 (y - 

This competes the proof when S = Id- This extends to general matrix S& S ++ (d, R) using 
that \\a(y)-a(x)\\ 2 s = \\(VSa)(y) - (VSa)(x)\\ 2 and the fact that (Au) m = Au m A* with 
A = y/S. □ 

3.5 Examples 

3.5.1 An example of confluent diffusion with increasing drift 

Assume that a : R d ->■ M(d, d, R) is defined by cr(a;) = x <g> A + cr° where cr £ A4(d, d, R) 
and A : M. d — > M. d is a bounded Lipschitz function (such that a is Lipschitz too). If there 
exists p£ (0, |) such that 

{b{x)\x) - p\x\ 2 \X{x)\ 2 . . 

hmsup v Wl ; — n 1 1 ^ 71 = -co (3.17) 
M-H-oo (1 + |x| 2 ) p+ 2 

then the diffusion (1.1) has at least one invariant distribution v. Thus, if 

rLl0 (b(x) -b(0)\x) . 

6 + = sup , , ? + +oo, 
x^O W\ 

the above condition is satisfied as soon as 

liminf |A(x)| 2 > 2[b]° + . 

\x\— >+oo 
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The key is to introduce the Lyapunov function V(x) = (a + |x| 2 ) p+ 2. Using that \\(a — 
a°)(x)\\ 2 = \(a-a°)*(x)f { \ 2 = |A(x)| 2 |:e| 2 , we deduce that 

\h^)t - (p + \)\o>(x)^[ = "P|A(x)| 2 |x| 2 + 0(1) 

and it follows that ]haswp\ x \^. +00 AV(x) = — oo if (3.17) is fulfilled (where A denotes the 
infinitesimal generator of (1.1)). 

If the function A is constant, the diffusion is asymptotically pathwise confluent (so that 
v is unique for (1.1) and the duplicated system has v& as unique invariant distribution) 
as soon as there exists Eq > satisfying 

\x - y\ < e =>■ (b(x) - b(y)\x - y) - ^|A| 2 |x - y\ 2 < 0. (3.18) 

This is a consequence of Proposition 3.4 applied with S = Id (the directional ellipticity 
assumption (i) is clearly true since |(<r*(x) — a*(y))(x — y)\ = \X\.\x — y\ 2 ). If b is smooth 
this condition is satisfied as soon as, for every xG R d , \{J b + J b *)(x) < l\X\ 2 I d in S{d,R) 
(Jb(x) denotes the Jacobian matrix of b). 

3.5.2 Baxendale's model 

Let Hi = (X t ,Y t ) be the unique strong solution to the 2-dimensional SDE 

2 

dX t = (a-^)X t dt-(aY t -e x )dW t 

2 

dY t = [b - °-^)Y t dt + (aX t + 9 Y )dWt 
where W is scalar standard Brownian motion, a, b, a are real numbers satisfying 



ab < 0, a + b < 0, a > "'' 



a + b 



and 9 X , 9 Y £ R. When 9 X = 9 Y = 0, this system is known as Baxendale's system 
(see e.g. [KP92]). Its stochastic stability has been extensively investigated in connection 
with its Lyapunov exponent. Then set 



A = A(ff) = b a + y/ ^ 2 a)2 + aA e (0, 1) and a = a 2 - (a + b) - ^{a-b) 2 + a^ > 0. 
and | • |a = | • | s with S = Diag(l, A). Ito's Lemma implies 

d\E t \\ = ( - a\E t \l + 9 X (9 X - 2aY t ) + X9 Y (9 Y + 2aX t ))dt + G(E t )dW t 



where @(x,y) = 2((A — l)cjxy + \9 Y x + 9 x y). It is clear that there exists /3 G R + such 
that 

\9 x (9 x -2ay) + X9 Y (9 Y +2ax)\ < P\(x,y)\ x . 
Then using that < J- + f an d setting j3' = (3 + we derive that 

d\E t \l<p'-^\E t \ 2 dt + &(E t )dW t 
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where < C|£|>. Hence, the function V(£) = is a Lyapunov function for the system 
since AV < j3' — %V. As a consequence there exists at least one invariant distribution v 
for the system and any such distribution satisfies v(V) < 2^ . 

At this stage we can compute the non-infinitesimal 5"-Lyapunov exponent of the du- 
plicated system. Tedious although elementary computations show that, for every £ = 
(x,y), e = (x',y')eR 2 , 

A S (C,C J = -- - (A - 1) a |g _^|4 < °- 

4 Application to the Richardson-Romberg extrapolation for 
the approximation of invariant distributions 

As an application, we investigate in this section the Richardson-Romberg {RR) extrapo- 
lation for the approximation of invariant measures. Roughly speaking, the aim of a RR 
method is generally to improve the order of convergence of an algorithm based on an 
discretization scheme by cancelling the first order error term induced by the time dis- 
cretization of the underlying process. However, to be efficient, such a method must be 
implemented with a control of its variance. We will see that in this context, this control is 
strongly linked to the uniqueness of the invariant distribution of the duplicated diffusion. 



4.1 Setting and Background 

4.1.1 Recursive computation of the invariant distribution of a diffusion: the 
original procedure 

Following [LP02] and a series of papers cited in the introduction, we consider here a 
sequence of empirical measures (z^(^> dx)) n >± built as follows: let (7 n )n>l denote a non- 
increasing sequence of positive step parameters satisfying 

n 

7n > and r n = 2^ 7fc > +oo. 

k=l 

We denote by (X n ) n >o the Euler scheme with step sequence (7 n )n>i defined by Xq = x£ 
R d and 

X n+ \ = X n + 7 n+ i6(X n ) + ^n+i a (X n ) U n+ i 

where (U n ) n >i is a sequence of i.i.d. centered M^-valued random vectors such that = I q 
defined on a probability space A, P). The sequence of weighted empirical measures 
(vn(u, dx)) n >i is then defined for every n > 1, by 

1 n 

V n(u, f) = jX^. lH 
n k=l 

where 5 a denotes the Dirac mass at a£ M. d and (%)fc>i is a sequence of positive weights 
such that H n = Ylk^Vk n ~ >+00 > +oo. When rjk = Jk which corresponds to the genuine 
case, we will only write i/ n (oj,dx) instead of Un(w,dx). For this sequence, we recall in 
Proposition 4.7 below in a synthesized form the main convergence results (including rates) 
of the sequence (un(uj, dx)) to the invariant distribution v of (Xt). In this way, we introduce 
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two assumptions: 

(S a ) : (a > 0) There exists a positive C 2 -function V : R d — > R with 

lim V(x) = +00, |W| 2 < CV, and sup ||D 2 V(x)|| < +00 

\x\-++oo agRd 

such that there exist some positive constants C&, j3 and a such that: 

(i) \b\ 2 < C b V a , Tr(aa*)(x) = o{V a (x)) as \x\ -»• +00 (iij (VV|6) < /3 - aF a . 

This Lyapunov-type assumption is sufficient to ensure the long-time stability of the Euler 
sheme (in a sense made precise below) as soon as a G (0,1]. Note that the convergence 
can be obtained under a less restrictive mean-reverting assumption including the case 
a = (see [Pan06]). The second assumption below is fundamental to establish the rate 
of convergence of (f„(w,/)) to v(f) for a fixed smooth enough function / : R d — > R: we 
assume that / has a smooth solution to the Poisson equation (see [PV01] for results on 
this topic). 

(C(f , k)): There exists a C fc -function g : R d — > R solution to / — u(f) = Ag such that /, g 
and its partial derivatives up to k are dominated by V r (r > 0): |/| < CV r and for every 

a = (at, ...,a d )£N d with \a\ := cti H h a d € {0, . . . , fc}, |d' "1 a d #| < CV r . 

x h '"'' Xi d 

Before recalling the results on (v n (cj,dx)), let us introduce further notations. We set 

n 

VrGN, rW=^7j 
fe=l 

and for a smooth enough function h : R d — > R and an integer r > 2, we write: 

DMfcOr) yi ® ■ ■ ■ ® y r = £ ^...^(x)^ 

(u,...,i r )e{i,...,(i}'- 

Proposition 4.7. Assume (S a ) /10/ds /or an a £ (0,1] and C7i G n p> oL p (P). Assume 
that (rjk/jk) is a non-increasing sequence. Then, 

(i) For every non-increasing sequence (0 n )n>i swc/i i/iat Xm>l ^n7n < +°° an d f or every 
r>0, En>i e nlv E[V r {X n )\ < +00. 

(u) For every r>0, sup n>1 fn(w, y r ) < +00 a.s. In particular, (vn{oj,dx)) n >\ is a.s. tight. 
(Hi) Every weak limit of {iQioj, dx)) n >i is an invariant distribution for (X t ) t >Q. Further- 
more, if (SDE) has a unique invariant distribution, say v, then i>n(u),f) ra ~ 5>+ °°> v(f) a.s. 
for every v-a.s continuous function f such that \ f\ < CV r for an r > 0. 

(iv) (Rate of convergence when rjk = jk) : Assume that v is unique and that E[C/® 3 ] = 0. 
Letk>\ such that f : R d -> R satisfies (C(f,k)). Then, 

.Ifk = 4and?£L^^0, 

v/f; (u n (u, f) - !/(/)) ^> A/-(0; /" |a*Va| 2 aV) as n -> +00. 
• Ifk = 5 and ^£ "^ + °°> /3 G (0,+oo], 

> >/r n (i/ n (w, /)-!/(/)) M-tf(pmV>;J |<T*Va| 2 av) as n -> +00 if /3 G (0,+oo), 

> -^y (f n (w, /) - v(f)) —> mjp as n -> +00 if /3 = +00 
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where = ip\dv with 



9 

<pi(x) = \D 2 g(x)b{xf 2 + iEpyx)&(x)( ff (x)I7i)® 2 ] + ^E^CxJKx)^)® 4 ]. (4.19) 

The first three claims part (i), (ii) and (Hi) of the theorem follow from [LP03] whereas 
the (iv) is derived from [LP02] (see Theorem 10) and [Lem05] (see Theorem V.3), in which 
the rate of convergence is established for a wide family of weights (%). 

Applying (iv) to polynomial steps of the following form: 7„ = Cn~^, p,E (0,1], we 
observe that the optimal (weak) rate is n _1//3 and is attained for \i = 1/3. Then 



/3 = V6Ci and ~ \/3C/2n3. 

so that 

n*(y n (u,,f)-v<J)) B>M(2Cm^;^ j \a*Vg\ 2 dv). 

This corresponds to the case where the rate of convergence of the underlying diffu- 
sion toward its steady regime (\/Tn corresponding to yt in the continuous time setting, 
see [Bha82] for the CLT for the diffusion itself) and the discretization error are of the same 
order. From a practical point of view it seems clear that a balance should be made be- 
tween the asymptotic bias and the asymptotic variance to specify the constant C. Under 
slightly more stringent assumptions we prove that the L 2 -norm of the error v n (uj, /) — v(f) 
satisfies 

K(u,,/) - u(f)\\ L2 ~ n-^4C 2 (mi 1) )2 + A J \ a *v g \2 du . 
An optimisation with respect to C gives the optimal choice C = ( 12 ^ Rd 5 . 

V (rrig ') 2 J 

When fj, £ (0,1/3), the step sequence decreases too slowly and the error induced by 
the time discretization error becomes prominent. That is why we propose below to use an 
RR extrapolation in order to cancel the first-order term in the time discretization error: 
in practice this amount to killing the bias nig in order to extend the range of application 
of the rate \fT^ (which corresponds to the standard weak rate y/i in Bhattacharia's CLT) 
to "slower steps". 



4.1.2 The Richardson-Romberg extrapolated algorithm 

As mentioned before, the starting idea is to introduce a second Euler scheme with step 
sequence (7 n )n>i defined by 

Vn > 1, 72(n-l) = 72n = y • 

As concerns the white noise of both schemes, our aim is to make them consistent in 
absolute time and correlated (with correlation matrix p satisfying I q — p*p£ S + (d, R)). 
To achieve that we proceed as follows. 

Let (Z n ) n >i be a sequence of i.i.d. Revalued random vectors lying in n p >o£ p (P) and 
satisfying 

E Z x = 0, E Zl = I q , E[Zf 3 ] = E[Zf 5 } = 0. 
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Then we devise from this sequence the white noise sequence (U n )ri>l of the "original" 
Euler scheme with step (7n)n>iby setting 

Vn > 1, U n = (Z 2n _x + Z 2n ) . (4.20) 

The white noise sequence for the second Euler scheme (with step (7 n )n>i)> denoted 
is defined as follows: 

Zfi = p*Z n + T(p)F n , n > 1, (4.21) 

where {V n )n>i is also a sequence of i.i.d. centered random variables in M q with moments 
of any order satisfying Ey a = I q and E[Vj® 3 ] = E[V]® 5 ] = 0, independent of (Z n ) n >i and 
Tq(p) is a solution to the equation 

T q (p)T q (p)* =I q -p*peS + (d,R). 

(T q (p) can be chosen either as the commuting symmetric square root of I q — p* p or its 
Choleski transform). Note that (Z^ )n>l is built in so that it satisfies 

£ 7 { P )=Iq and Cov(Z n , Z^) = p. 

Then the Euler scheme with step j n and consistent ^-correlated white noise (^) n >i, 
denoted (Y n (p) ) n >i from now on, is defined by: 

= y^ p) + 7nHY( p) ) + ^(FW)^, n > 1, Y = y. 

Also note that (X n ,Y$) is an Euler scheme at time T n of the duplicated diffusion 
(X t , X t p ) t > . 

For numerical purpose, one usually specifies the independent i.i.d. sequences (Z n ) n >\ 
and (V^) n >i as normally distributed so that they can be considered as the normalized 
increments of two independent Brownian motions W and W i.e. 

n = 7= and v n = -= , n > 1. 



'7n V7n 

Note that in this case, (U n ) is also a sequence of M(0, J g )-random variables. This implies 
in particular that 

E[£/f 4 ] = E[Z® 4 ] and E[E/f 6 ] = E[Zf 6 ]. (4.22) 

Since these properties simplify the result, we will assume them in the sequel of this section 
(see Remark 4.6 for extensions). 

We denote (v% (oj, dx)) n >i the sequence of empirical measures related to (Y^ (w)) n >i 
(in which the weights are adapted accordingly: r/i/2, 771/2, 772/2, 772/2, 773/2, . . . ). The em- 
pirical measure (i^J (^j dx)) n >\ associated to the Richardson-Romberg extrapolation is 
defined by 

1 n 

<> {p X",f) = jEt«d(«))+/*i(«))) 

k=l 

= ^ X> (/(^IdH) + /(4 p !iH) - /(**(")) 
n fc=i 
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Under the assumptions of Proposition 4.7, it is clear that (u,dx) > u(dx) 

a.s.. 

Thus, in the next section, we propose to evaluate the effects of the Richardson-Romberg 
extrapolation on the rate of convergence of the procedure and to explain why the unique- 
ness of the invariant distribution of the duplicated diffusion plays an important role in 
this problem. 



4.2 Rate of convergence of the extrapolated procedure 

Throughout this section we assume that r\k = Ik and so we will write u n , u n and u n p ^ in- 
stead of Un, u% and Un respectively. We also set (D 3 9i,.,.)i=i = D 2 (V )g in order that 
the notation Tr(cr* D 2 (V )ga) stands for the vector of R d defined by Tr(er* D 2 (V )ga) = 
(Tr((j* D 2 (d Xi g)a))f =1 . For a fixed matrix p, the main result about the RR extrapolation 
is Theorem 4.3 below. At this stage, we do not discuss the choice of the correlation p in 
this result. This point is tackled in Proposition 4.8 in which we will see that the optimal 
choice to reduce the asymptotic variance is atteined with p = I q as soon as u& is the 
unique invariant distribution of the associated duplicated diffusion. This emphasizes the 
importance of the question of the uniqueness of the invariant distribution in this pathologic 
case studied in the previous part of the paper. 

Theorem 4.3. Assume (S a ) holds for an a £ (0,1]. Assume that (X t , )t>o admits 
a unique invariant distribution p^ (with marginals u). Let f : M. d —¥ R be a function 
satisfying (C(f , 7)) and such that (p± defined by (4.19) satisfies (C(</?i,5)) with a solution 
to the Poisson equation denoted by g Vl . Then, 

;(„(f>( W) /)-*(/)) n ^M(0;a 2 ) 



where 



a*Vg\ 2 du-4 ((a*Vg)(x)\p(a*Vg)(y))p^(dx,dy). (4.23) 



// i-s — y (3 e (0, +oo], then 



Vf n (u n p \u, f) - =M- N(P mP;af) as n -> +oo if /3 G (0, +oo), 

■pi 

" (u n p) (io, f) - u(f)) ^ m {2) as n -> +oo if jfl = +oo, 



r (3) 



6 ~2(fc-3) 

<p 2 (x) = ^ -^n — E[D fc 5 (x)6(x) 0(6 - fc) (a(x)C/i)® 2 ( fc - 3 )] . (4.24) 

fc=3 

Remark 4.6. [> We recall that the result is stated under the assumption that the in- 
crements are normally distributed or more precisely under Assumption (4.22). When this 
additional assumption fails (think for instance to Z\ ~ fi(<L-i + <5i)) 9 ), the result is 

(2) 

remains true except for the value of m g which becomes more complicated since it also 
depends on E[Zf % t = 4 and 6). 
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> This result extends readily to general weights sequences (ry n ) n >i.Some technical condi- 
tions appear on the choice of weights but these conditions are natural and not restrictive 
(see [Lem05]). In particular we can always consider the choice r/ n = 1 for which we obtain 

the following result: if , r " } "~ S>+0 °> /3 G (0, +00), then 

vrj, l) 



(uW(uJ)-u(f)) ±±N{p m f ;a 2 p ) as n -> +00. 



> Polynomial steps. Let j n = Cn ^ /*G (0, 1]. If fi > |, l4 ' ->• ThJ < +00 so that 
-> as n -> +00. If < |, x n 1 ^ (and if // = |, ^£= X ^p). Consequently 

il 3) 1 ii 3) 1 if } ~ , 1 

<J=^ /i > -, — >• +00 <J=^ /U < - and — > p G (0, +00) <J=^ /i = -. 



\dn 5 yTV^ 5 vTn 5 

When /x= 1, /3 = C§V5 and y 7 ^ ~ ^pni 
As a consequence, if r y n = f]n = Cn~~s , 

4^ 



n§ ( V W( W) /)-,(/)) ^ AA(2C 2 mf; 



' 5 C, 

12 1 

We switch from a weak rate n5 to ns i.e. a "gain" of nis (see figure below). The second 
noticeable fact is that the bias is now significantly more sensitive to the constant C than 
in the standard setting. If we minimize the L 2 -norm of the error Un (a;,/) — v{f) we 
obtain the optimal choice of C as a function of both bias and standard deviation, precisely 



C 



,20(rr4 2) ) s 
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4.2.1 Optimal choice of p and uniqueness of p^ Id ^ 

Proposition 4.8. Let p be an admissible correlation matrix i.e. such that p*p < I q . 
Assume that the duplicated diffusion (X,X^) has a unique invariant distribution yy) (so 
that if p = I q , /i* 7 ^ = 



(a) a 2 > [ \a*Vg\ 2 dis. 

jR d 

(b) lfp = then al = 5 f \o*Vg\ 2 dv. 

JR d 

(c) Ifp = I q , a 2 = f \a*Vg\ 2 du. 

jR d 



Proof. Claims (6) and (c) being obvious thanks to (4.23), we only prove (a). Keeping in 
mind that both marginals p( p \~R d x dy) and p^ p \dx x (R d ) are equal to u, one derives 
thanks to Schwarz's Inequality (once on M. d and once on L 2 (p)) from the expression (4.23) 



of the asymptotic variance a p that 



\o-*Vg\ 2 dv - 4 
o / \o-*Vg\ 2 dv-4: 



\0~ 



Vg\ 2 (x)^(dx,dy) 
a*Vg\ 2 {x)v(dx 



*Vg\ 2 (y)p^(dx,dy) 



\pa*Vg\ 2 {y)v{dy) 



> hi \<r*Vg\ 2 dv -4 [ \a*Vg\ 2 du= [ \a*Vg\ 2 du 
jR d Jm d Jm d 



where we used in the last inequality that \pu\ 2 < \u\ 2 . 

The previous result says that the structural asymptotic variance of the RR estimator 
is always greater than that of the standard estimator but can be equal if the Brownian 
motions are equal. This condition is in fact almost necessary. Actually, thanks to the 
Pythagorean identity, 

a 2 = 5 f \<T*Vg\ 2 dv + 2 [ \a*Vg(x) - pa*Vg(y)\ 2 p^(dx,dy) 
Jm d JR 2d 

\a*Vg(x)\ 2 v(dx) - 2 f \pa*Vg(y)\ 2 u(dy). 

jR 2d 

Then, since p* p < I q , a necessary condition to obtain a 2 = J Rd \o*Vg\ 2 dv is 

\po-*X7g(y)\ = \a*Vg(y)\ v(dy)-a.e. 
When p* p < I q , this equality can not hold except if a*Vg(y) = v(dy)-a.e. 



4.3 Proof of Theorem 4.3 
4.3.1 Preliminaries 

Without loss of generality, we assume that / satisfies v(f) = so that / = Ag under 
(C(f,k)). We denote by 7W the sequence defined by 7^ = 7^. 
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Lemma 4.1. Assume that f satisfies (C(f , 7)) and denote by g the solution to the Poisson 
equation Ag = f . Then, 

n 

r n 4'V,/) =2(g(Y 2n )-g(Y ))-(g(X n )-g(X ))-Y,Vlk(^ AM k ) ~ AM f) ( 4 - 25 ) 

k=i 

-El-El + N n + R n (4.26) 

where 

AM? = (V 5 (X fc _i)|<x(Jr fc _i)tf fe ), 

AM® = (Vg(Y 2{k _ 1) )\a(Y 2{k ^ 1) )Z 2k . 1 ) + (V g{Y 2k ^)\a{Y 2k ^Z 2k ), 

n 2 n 

^ = 2 E (y) M%-i)) +E^i(?2fc-i)|^-i]) -£)7&>i(**-i), 
fc=l fc=l 

n g n 

^ = 2 E (y) M%-i)) +E[ ¥ » 2 (y 2fc -i)|J r fc-i]) - ^7^2(^-0 

fc=l A;=l 

wii/i and (p 2 defined by (4.19) and (4.24), 
(N n ) is defined by 

n 

N n = ^2(AN(Y 2{k _ 1) ,Z 2k _ l ,'^) + AN(Y 2k _ 1 ,Z ~ L • — ' 1 
k=l 

where AN(x,U,j) = H(x,U,j) - E x [H(x, U,^)] and 

2 



H(x,U n ) = ^D 2 g(x)(a(x)Ur 2 + lY / Ct i >y*?D*g(x)b(x)® l (*(x)U) 



6 ^o 



+ k E 7^Ct e D*g(x)b( x r(o-(x)Ur^ + 7 3 £ ^^ 5 (x)6(x)^ 6 ^(a(x)[/)^. 

Finally, if (S a ) holds, the sequence (R n )n>i satisfies the following property: there exists 
r > suc/i that, a.s., for every n > 1, 

E[\AR n \\T n ^} < Cji (V r (X n ^) + ^ (%_!)) + ^ r (F 2ri _i)) (4.27) 
where AR n = R n — R n -\. 

Remark 4.7. The above decomposition is built as follows: the second term of (4.25) 
is the main martingale component of the decomposition whereas £ nj i contains the first 
order discretization error. Thanks to the Richardson-Romberg extrapolation, £ n ^% is in 
fact negligible when n — > +oo. When the step sequence decreases fast (Theorem4.3(i)), 
the rate of convergence is ruled by the main martingale component. In Theorem 4.3(u), 
the rate is ruled by £ nj i and £ n)2 . Finally, N n contains all the negligible martingale terms. 
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Proof. Owing to (C(f , 7)), to the Taylor formula and to the fact that E[D 2 (x)(a(x)U 1 ) m ] = 
Tr(a*(x)D 2 g(x)a(x)), we have 

g(X k ) = g{X k _{) + ikAgiXk-i) + \^AM M (4.28) 
+ \ {D 2 (X k ^)(a(X k ^)U k f 2 - E[D 2 (X k ^)(a(X k ^)U k r 2 \T k ^}) (4.29) 



+ D l 9(Xk-i) {lkKXk-i) + V^°(Xk-i)U k f l (4.30) 



1=3 

+ D 7 g{i k ) ( lk b{X k _ x ) + VwK^fc-iW 7 (4.31) 



where £ fc G [X k _ l7 X k ]. The fact that |W| 2 < CV implies that W is a Lipschitz con- 
tinuous function with Lipschitz constant denoted by [-v/V^ip- Then, setting ||D 7 g(2;)|| = 
sup| Q | =7 \d a g(x)\ and using Assumption (C(f, 7)), we have 

\\D 7 g(H k )\\ < C(W(£ k )) 2r < C{y/V{X k _ x ) + [W] Lip \AX k \) 2r (4.32) 

where AX k = r y k b(X k _i) + y / ; y k o'(X k _i)U k . Then, owing to the elementary inequality 
\a + b\ p < c p (\a\ p + \b\ p ) and to Assumption (S a ), it follows that there exists r > such 
that 

E[|Ai?n||.7Vi] < C 7 lv r {X k ^). 

Then we plug this control into the above Taylor expansion and to compensate the terms 
of (4.30) when necessary. An appropriate (tedious) grouping of the terms yields: 

j k Ag(X k ^) = g{X k ) - g{X k ^) - ^AM M 

-7^i(X fc _ 1 )- 7 | V2 (X fe _ 1 )- AN{X k _ x ,U k ^ k )- AR n 

where R n> 2 satisfies (4.27). Making the same development for Ag(Y 2 [ k - X )) and for Ag(Y2 k -i) 
and summing over n yield the announced result. □ 

Lemma 4.2. Let a £ (0, 1] such that (S a ) holds. Assume that {X u X { t p) )t>o admits a 
unique invariant distribution fjS p \ Let g be a ^-function such that \Vg\ < CV r where 
r€R + . Then, 



1 . _ 

-^EV7 fc (^2AMf -AM, 



k > ==> °p- 



Proof. Let {£ k>n , k = 1, . . . , n, n > l}be the triangular array of (J-fc)-martingale increments 
defined by 



Let us show that 



k=l 

First, using that = L q , we obtain that for every k > 1, 

EOAM^lVfc-i] = k*V<?(AV-i)| 2 . 
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Since x h-> \a*Vg\ 2 (x) is a continuous function such that |<r*Vg| 2 < CV r for a positive r, 
it follows from Proposition (4.7) that 

— ^TfcEtlAM^lVfe-i] / k*V^| 2 (x)«/(dx). (4.33) 

n fe=i ^ 

Similarly, 

E[\AMjp\ 2 \T k -i] = |(7*V & (y 2(fc _i))| 2 + E[|a*V ff (Y 2ifc _i)| 2 |.7V 1 ]. 
It follows that 

1 n 1 n 

-^ 7fc E[|AMf |V*-i] = 2^)(a;,|a*V 5 | 2 ) - =- J> (4.34) 
1 n fc=i 1 n fc=i 

where (£&) is a sequence of (J-fc)-martingale increments defined by 

Cfc = 7fc (k*V 5 (y 2fc _!)| 2 - E[|a*V 5 (F 2 fc-i)| 2 |^-i]) • 

Using that |<r*Vg| 2 < CV r for a positive real number r, we obtain by similar arguments to 
those used in (4.32) that E[|C fc | 2 |JV-i] < Cy 2r (y 2(fc _ 1) ). We derive from Proposition 4.7(») 
applied with 8 k = i that 

^ L. 



5>[£ |7- fc _ 1 ]<C 7 i^^y 2r (y 2M )<+oo 



fc=l fe=l * fc 



since 

As a consequence (X^fe=i r^~)n>i is a convergent martingale and the Kronecker Lemma 

then implies that jJ- X^fc=i Cfc w " >+00 > a.s. Thus, we deduce from (4.34) combined with 
Proposition 4.7 that 

1 n 

^2^n\AMi 2) \ 2 \T k ^}^^2u(\a*Vg\ 2 ) a.s. (4.35) 
Ln k=i 

Finally, we have to manage the cross-product: keeping in mind the construction of the 
noises of the Euler schemes (see (4.20) and (4.21), we have: 

VmiAM^AMk^Tk-t] = ((<7*V^)(^jfc_i)|p(<r*V5)(y 2(fc _i))) 
+ ((cr*Vg)(X k _ 1 )\p(a*Vg)(Y 2k -i)) - T^Cf 

where 

Cf = Ik (((a*V 5 )(X fe _i)|p(a*V 5 )(Y 2 ^ 
so that 

1 n i n 

- £ V2E[amWam, >2 |j w ] = M W(^) + WO - E ^ ( 4 - 36 ) 
in fc=l in fc=l 
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where ip : M. 2d — > R is defined by ip(x,y) = (a*\/g(x)\p(a* J \/g)(y)) and for every Borel 
function / : R 2d -> R, 

1 n n 

A4 1) (/) = p-Z)^(^*-i' :P 2(*-i)) and /^ 2) (/) = r-Z)^/(^-i'^-i)- 
1 n fc=i 1 n it=i 

By straightforward adaptations of the proof of Proposition 4.7, we can show that if 
(Xt,xjf^) has a unique invariant distribution then, for every continuous function 
/ such that / < CV r with r > 0, 

jU «( a;)/ )^±^ M (p)( /) a . s . with i = 1,2. 

As a consequence, Hn\^) + fJ>n (ip) n ~ 5>+00 > 2fi(tp) a.s. Finally, by martingale arguments 
similar to those used for (Ck), one checks that T" 1 Y^l=i Cfc^ n ~ >+ °°> a.s. Thus, by (4.33), 
(4.35) and (4.36), we obtain that 

n 

Y.nikA^k-i] ^(W*Vg\ 2 ) - = a 2 . 

k=\ 

Then, the result follows from the CLT for arrays of martingale increments provided that 
a Lindeberg-type condition is satisfied (see [HH80], Corollary 3.1). To be precise, it is 
enough to prove that there exists 5 > such that 

n 

^E[|e M | 2+<5 |-F fe -i]^^0 a.s. (4.37) 
fc=l 

Using Assumption (S a ) and the fact |Vg| < CV r (r > 0), one can check that there exists 
r > such that 

J- n 

Thus, 

k=l ln 

Checking easily that > 0, (4.37) follows from Proposition 4.7(h). □ 

Lemma 4.3. Lei a G (0, 1] suc/i f/iaf (S a ) holds. Assume that (Xt) admits a unique 
invariant distribution v. Assume (C(f, k)) and that rffl w ~ >+ °°> +oo. Then, 
(i) If ^ i defined by (4.19) satisfies (C(</?i,5)) then, 

1 c -^+00 1 (i) 
J- n 

fn,) If the derivatives of g up to order 6 are continuous and dominated by V r (with r > 0), 

1 „ -^+00 1 , . 

C n ,2 > —ZVVPV a - s - 



1 n 
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Proof, (i) Writing 

2 E (y) + E^iCFajfc-OITfc-i]) = ^ ^ (^i(? 2 (*-i)) + <M^-i))+]T ^AT, 

fc=i fc=i fe=i 

with ATfc being a martingale increment denned by AT& = Ef^i^fc-iJI-^fc-i] — ^lC^jfc-i), 
one obtains that 

n 2 

= 4 2 ) [(^.w - V )M - (uf } - + £ y AT *- 

fc=i 

Applying Theorem V.3 of [Lcm05] (which is an extension of Proposition 4.7(w) to general 
weights) with rjf, = 7^ and q* = 4, we obtain that 

r (2) ^ 

-^y(^n - f)(<Pi) °°> ™ 9w e R in probability. 
Similarly, applying this result to the Euler scheme with half-step, we have: 

r^ 2) (2) ( P ) 1 (2) ( P ) _> 1 

"TfrK^n ' P = o " 2 7 fc t(^n ' " ~ V )M\ n ^ + °°} -m g £ M in probability. 



Thus, it remains to show that the martingale term is negligible. We set Ok = ■ Using 

that (7^) is non-increasing, one checks that (6 n ) is non-increasing and that ^ #fc7fc < +00. 
Since \(p±\ < CV r with r > 0, it follows from Proposition 4.7 that 



k>l \ L k > 



This implies that the martingale 2J "(lyATfc is a.s. convergent so that the Kronecker lemma 
yields -737 Sfc=i TfcATfc n ~ 5>+0 °> a.s.. The first assertion follows. 
Remark that 

c 1 7 ( 3 >, (p) 7 ( 3 >/ A , 7fe^ 

£«,2 = ,W - < (W, p 2 ) + 2^ "J T fc- 

fc=l 

Under the assumptions, </?2 is continuous and dominated by V r with a positive r. Then, 
since rffl n ^ +oc > +00, {yl '^(<£>2))n>i and (1/^ (u, </?2))n>i converge to vfa)- With 
some similar arguments as previously, one checks that the martingale term is negligible 
and the second assertion follows. □ 



4.3.2 Proof of Theorem 4.3 

For the sake of simplicity, we choose to give the proof of Theorem 4.3 only when Y n > 

+00. Note that if 7„ = Cn~^, this corresponds to \x < 1/3, i.e. the case where the Romberg 
extrapolation really increases the rate of convergence (see Remark 4.6). 

By the decomposition of Lemma 4.1 and the convergences established in Lemmas 4.2 
and 4.3, one checks that it is now enough to prove the following points: 

^(2(g(Y 2n )-g(Y ) - (g(X n ) - X )) as n -> +00, (4.38) 
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^N n A and ^R n A as n -»• +00 (4.39) 

J- TV -L 71 



with 9 n = \/T^ V -^fy. 

For (4.38), the result is obvious when g is bounded. Otherwise, we use Lemma 3 of 
[LP03] which implies in particular that for every p > 0, E[F P (X„)] < C p T n . By Jensen's 
inequality, this implies that for every r > and a 6 (0, 1], there exists a constant C > 
such that 

Vn > 1, E[V r (X n )} < {E[V^(X n )\) a < ClT". 

a 

Thus, since the same property holds for the (Y n ) and since \g\ < CV r with r > 0, (4.39) 
follows taking a G (0, 1/2). 

For the first assertion of (4.39), we use a martingale argument. We denote by {irk,n,k = 
1, . . . ,n, n > 1} the triangular array of (J-fc)-martingale increments defined by 

AN k 



Then, in order to prove the convergence in probability of {N n / y/T\i) to 0, we use the CLT 
for martingale increments which says that, since a Lindeberg-type condition holds (we do 
not prove this point, see Proof of Lemma 4.2 for a similar argument), it is enough to show 
that 

n 

^E[|vr M |Vfc-i] ^±^0 a.s. (4.40) 
fc=i 

Under the assumptions on g and on the coefficients, one checks that there exists r > 
such that 

n 1 n 

J2n\«k, n \ 2 \ fk-i] < c T Y,il (^ r (^-i) + v r (Y 2{k _ 1} ) + vpn-i)) . 

k=l n k=l 

By Proposition 4.7, sup n>1 (v n (cj,V r ) + vi P \uj,V r )j < +00. Assertion (4.40) follows. 

As concerns R n , it follows from a martingale argument that 
1 n 

-^J^(Afl fc -E[A22 Jfe |.F fc _ 1 ]) AO asn^+oo. 

k=l 

Now, since sup n>1 \ Un (uj,V r ) + vZ ^ p \uj,V r )j < +00 a.s. and since 7„ "~ !>+00 > q, we 
deduce that 

1 ™ 

— T ^E[Ai2 fc |J- fc _i] AO. 



r (3) 

1 n fc=l 



The last assertion follows. 
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A Hypo-ellipticity of the correlated duplicated system 



It is a well-known fact that, for a Markov process, the strong Feller property combined 
with some irreducibility of the transitions implies uniqueness of the invariant distribution 
(see e.g. [DPZ96], Theorem 4.2.1). For a diffusion process with smooth coefficients, such 
properties hold if it satisfies the hypoelliptic Hormander assumption (see [H6r67, H6r85]) 
and if the deterministic system related to the stochastic differential system (written in the 
Stratanovich sense) is controllable. In fact, both properties can be transferred from the 
original SDE to the duplicated system so that its invariant distribution is also unique. 
The main result of this section is Proposition A. 9. Before, we need to introduce some 
Hormander- type notations. First, written in a Stratonovich way, X is a solution to 

q 

dX t = A (X t )dt + Y, A j{ x t)°dWi (A.41) 
i=i 

where Aq,...A„ are vectors fields on W 1 defined by 1 : 



d 

8=1 



2 

ho 



d,. 



and for every j G {1, . . . , q}: 

i=i 



For the sake of simplicity, we assume that b and a are C°° on M rf with bounded derivatives. 
We will also assume the following Hormander condition at each point: there exists JVgN* 
such that Vxel d , 

dim (Span {A 1 (x),A 2 {x), . . .,A q (x), L. B. of length < iV of the A,-(x)'s ,0 < j < q}) = d 

(A.42) 

where "L.B." stands for Lie Brackets. The above assumptions imply that for every t > 
and xS M d , Pt{x, .) admits a density pt{x, .) w.r.t. the Lebesgue measure and that (x, y) \-t 
Pt(x,y) is C°° on M. d x M. d (see e.g. [Cat92], Theorem 2.9). In particular, x ^ Pt(x, .) is a 
strong Feller semi-group. Assume also that the control system (associated with (A.41)) 

q 

i (B) = Ao(x^) +J2M xiu) ) u i> (A. 43) 

3=1 

is approximatively-contiollab\e: 

There exists T > such that for every e > 0, x±,x 2 G M d , there exists u G L 2 ([0,T],R d ) 
such that (x^ u \t)) solution to (A. 43) satisfies x(0) = x\ and \x(T) — x 2 \ < e. 

{AAA) 

Under Assumptions (A.42) and (A. 44), the diffusion has a unique invariant distribution 
v. Actually, the controllability assumption combined with the Support Theorem implies 



1 With a standard abuse of notation, we identify the vectors fields and the associated differential oper- 
ators. 
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that for every non-empty open set O, for every x G M. d , Pt{x,0) > 0. The semi-group 
(Pt) is then irreducible. Owing to the strong Feller property, it follows classically that (Pt) 
admits a unique invariant distribution (see e.g. [DPZ96], Proposition 4.1.1. and Theorem 
4.2.1.). 

Furthemore, v is absolutely continuous with respect to the Lebesgue measure on R rf and its 
topological support is M. d (since for every open set O of M d , v{0) = f Pr{x,0)u(dx) > 0). 
Let us now consider the duplicated diffusion (Xt,Xf'). Setting z[ p ^ = (X t ,X^) and 
using the preceding notations, (1.2) can be written: 

dZP = MZ { t p) )dt + £ A^fydWi + £ A d+j (Z^)dWi 

3=1 3=1 



where A (z) = (A (x), A (y)) T (with A (y) = £\ =1 Hv) ~ vijivWy^iAy) 
and z = (x,y)), W is a (/-dimensional Brownian Motion independent of W such that 
iy(p) = p *W + (I q - p*p)^W and for every j G {1, . . . , q}, 

A J (z) = A j (x)+A ( j p \y) and, A q+j (z) = Af^ p)h \y) (A.45) 

where for a for a q x q matrix B, Aj B \y) = Y2i=i( a (y)B)i,jd yi . Then, the following 
property holds. 

Proposition A. 9. Let p G M 9)9 (M) such that p*p < I q . Assume that b and a are C°° 
on ~K d with bounded derivatives. Assume (A. 42) and (A. 44). Then, uniqueness holds for 
the invariant distribution of the duplicated diffusion (Xf, X^). Furthermore, if 
exists, then has a density p^ (w.r.t. \2d) which is a.s. positive. 

Proof. First, let us check the Hormander conditions for (Xt, X^)t>o- Setting S = (I q — 
p*p)z, standard computations yield 

Vj G {1, ... ,q}, A q+j (z) = SijMv)- 

i=i 

Since S is invertible, we deduce that {Ai(y),l = l,...,q} belongs to Span{ A g+ j (z) , j = 
1, . . . , q}. Similarly, checking that for every j G {1, . . . , q}, 



[A (z),A g+j (z)] = [A (y),Af\y)] = j^S^Mv), Mv)\ 

i=i 



one deduces from the invertibility of S that {[Ao(y), Ai(y)],l = l,...,q} is included in 
Span{[Ao(z), A q+ j(z)], j = l,...,q}. Owing to (A. 42), it follows that Span{d yi , . . . , d Vd } 
is included in 

V = Span ^A 1 (z),A 2 (z), A q (z), Lie Brackets of length< N of the Aj(z)'s , < j < q\ 

Now, let us show that Span{9 Xl , . . . ,d Xd } is included in V. Since Spanjc?^, . . . ,d Vd } is 
included in V, it is clear that for every x G M. d , Aj(x) = Aj (y) — Aj(z) also belongs to 
V. Since 

[Mz),Mz)] = [A {x),A 3 {x)) + [A (y),A<f\y)}, 
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[Ao(x), Aj(x)] has the same property. Using again (A. 42), we deduce that Span{5 Xl , . . . ,d Xd } 
is included in V and thus that dim(V) = 2d. As a consequence, for every z G M. 2d and 
t > 0, Q^(z, .) admits a density qt(z,.) w.r.t. X 2 d such that (z,z ; ) h-» q t (z,z') is C°° on 

In order to obtain uniqueness for the invariant distribution, it remains to show that there 
exists T > such that for every z G M. 2d , for every non-empty open set O of M. 2d , 
Qt(z,0) > 0. Owing to (A. 44), it is clear that for every z\ = (xx,yi) and z 2 = (x2,y2), 
for every e > 0, there exist u and u G L 2 ([0, T], R d ) such that z(t) = (x (u) (t), x (a) (t)), 
where and x^ u ' are solutions to (A. 43) starting from x\ and yi, satisfies \z(T) — 
z%\ < e. Furthermore, since S is invertible, we can assume that u = pu + Slo with 
a; G L 2 ([0, T], M rf ). Then, the support Theorem can be applied to obtain that for ev- 
ery zi,Z2,£ Qt(zi, B(z2, I) > and thus to conclude that for every z G M. 2d and every 
non-empty open set O, Qt(z,0) > 0. □ 



B Additional proofs about the two-dimensional counterex- 
ample 

Proof of (3.5): For the sake of completeness, we show that rt — > 1 a.s. as soon as ro > 0. 
First, note that uniqueness holds for the solution of the SDE (3.3) since the coefficients 
are Lipschitz continuous. In particular, (rj) defined a.s. by r\ = 1 for every t > is the 
unique solution starting from ro = 1. Owing to the strong Markov property, this implies 
that if r 1 := inf{i > 0, rt = 1}, then r< = 1 on {r < i}. The same property holds at 0. 
We deduce that {r t )t>o lives in [1, +oo) if ro > 1 and in [0, 1] if ro G [0, 1]. Moreover, if 
r > 1, we have d(r t — 1) = —(r t — l)(dt + "9dW t ) so that 

r t -l = e -(i+^)t+W t _ 

It follows that limt-^+oo rt = 1 since linif_ 5 . +00 = a.s.. Now, if ro G [0, 1], we have 

dr t = r t {l-r t ){dt + MWt). 

Thus, (rt) is a [0, 1] -valued submartingale. In particular, r t converges a.s. to a [0, l]-valued 
random variable r^. Since 

Vi>0, E[r t ) =r + E^ r s (l-r s )ds), 

it follows that E[J" +o ° r s (l — r s )ds] which in turn implies that Jo + °° r s(l — r s)ds < +oo a.s. 
As a consequence lim inf f_j. +00 r t (l — r t ) = a.s.. The process (r 4 ) being a.s. convergent 
to roo, it follows that r ro G {0, 1} a.s.. It remains to prove that P(r QO = 0) = 0. Denote 
by p the scale function of (rt) null at r = 1/2. For every r G (0, 1), 

p(r) = I e J i = | \ df. 

As a consequence, if i? G (0, \/2], lim r _s. +00 p(r) = +oo. This means that is a repulsive 
point and that, as a consequence (see e.g. [KT81], Lemma 6.1 p. 228), 



V6 G (0, 1) P( lim r a < n) := lim P(r a < r b ) = 

a— >0+ a-»0+ 
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where T a = inf{i > | n = a}, y G [0, 1]. We deduce that P(roo = 0) = 0. This completes 
the proof. □ 

Proof of (3.6): We want to prove that [i is invariant for (Xf,Xf) if and only if fi can 
be represented by (3.6). First, since the unique invariant distribution of {Xf) is X$u it 
is clear that \x = £(e*®°, e*( 0o+ ^°)) where Oo has uniform distribution on [0,27r] and Vq is 
a random variable with values in [0, 2tt). One can check that if Vq is independent of Go, 
fi is invariant. Thus, it remains to prove that it is a necessary condition or equivalently 
that K(6,dv) := C(e iV °\e i0 ° = e ie ) does not depend on 9. Denote by {e i0t , e ^ 6t+v ^) the 
(stationary) duplicated diffusion starting from (e 1 ® , e l ^ 0+v °^). Since /x is invariant, we 
have for every t > 

C(e iVt \e iet =e ie ) = K(9,dv) 

but thanks to the construction, for every t > 0, Qt = ©o + Wt and Vt = Vq (the angular 
difference between the two coordinates does not change) so that 

C(e iVt \e iet = e ie ) = J K(6' ,dv)pt(0,d6') 

where p t {9,d0') = C{e l(e+W ^). But p t (0,d6') converges weakly to when t — > +oo. 
From the two previous equations it follows that K(6, dv) does not depend on 9 since 
V9 > 0, K(0,dv) = J K(9',dv)Xs 1 (d9 r ). □ 
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